Unit 5 
Events occurring at random and 
population quantiles 


Introduction 


The term Bernoulli trial was introduced in Unit 3 to describe an 
experiment for which there are just two outcomes — success or failure, 

1 or 0, win or lose, and so on. In Unit 3, you met two probability 
distributions directly associated with a collection or sequence of 
independent Bernoulli trials in which the probability of success remains 
constant from trial to trial. The first, the binomial distribution, was 
developed as a model for the total number of successes in n trials. The 
second, the geometric distribution, is the distribution of the number of 
trials up to and including the first success. 


In Section 1 of this unit, we look again, but from a more practical 
viewpoint, at the distribution of the number of successes in a sequence of n 
Bernoulli trials when the number of trials is large, but the chance of 
success in each trial is small. In this case, where something rather unlikely 
has many opportunities to happen, the distribution of the number of 
successes has a very useful approximate form, as you saw in Section 4 of 
Unit 3: the probability model which gives the appropriate approximation 
to the binomial distribution in this sort of situation is the Poisson 
distribution. The result is sometimes known as Poisson’s approximation 
for the occurrence of rare events. It will be central to developments later in 
the unit. (Of course, the Poisson distribution is also an extremely 
important discrete probability distribution in its own right; some of its 
properties were given in Units 3 and 4.) 


A sequence of independent Bernoulli trials in which the probability of 
success remains constant from trial to trial is called a Bernoulli process. 
This is just the same notion as a collection or set of Bernoulli trials that 
you met earlier, but the word process is used to stress the sequential 
nature of trials in the contexts with which we are concerned in this unit. 
So a Bernoulli process is a useful model when an event either occurs 

(a success) or does not (a failure) at each of a clearly defined sequence of 
opportunities (trials) which take place one after another. The process is 
characterised by the assumptions that the probability that an event occurs 
is constant and is independent of the outcomes of previous trials. In this 
sense, occurrences of events are unpredictable: they occur at random 
points in the sequence of trials. 


However, many events occur in a random, unpredictable way in continuous 
time, where there is no notion of a trial. For example, domestic accidents, 
air crashes, floods and earthquakes may occur at any time, not just at a 
discrete set of opportunities. In Section 2, we look first at the distribution 
of the waiting time between events in a Bernoulli process, and then at an 
analogous continuous model for the waiting times between events that may 
occur at any time. The latter model is known as the exponential 
distribution. 


Introduction 





A rare event: weather conditions 


and the angle of the sun have to 
be just right for this waterfall in 
Yosemite National Park, USA, 
to look like a lava flow 
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In Section 3, another model is introduced, analogous to the Bernoulli 
process, but for events occurring at random in continuous time. This 
model is the Poisson process. It transpires that the number of events that 
occur in a Poisson process in any given interval of time has a Poisson 
distribution, and the waiting time between successive events in a Poisson 
process has an exponential distribution. The work of Subsection 3.2 then 
considers how we can decide, given data on the times at which a sequence 
of events occur, whether or not a Poisson process is a good model for the 
occurrences of these events. 


The final section of the unit, Section 4, is about something a bit different 
from the preceding sections of the unit: the population analogues of 
sample statistics such as the median and the quartiles. The results from 
this section will be applied to the models introduced in this unit, and will 
assume even greater importance in a number of other contexts later in the 
module. 


1 Bernoulli trials and the Poisson 
distribution 


In this section, we first investigate numerically the distribution of the 
number of successes in n Bernoulli trials when the number of trials is large 
but the chance of success, p, at each trial is small. This investigation gives 
a practical context to a result obtained mathematically in Section 4 of 
Unit 3: in the situation of many trials and small probability of success, the 
Poisson distribution provides a good approximation to the binomial 
distribution. The approximation is useful partly because Poisson 
probabilities are simpler than binomial probabilities when n is large and p 
is small, but also because the approximating Poisson distribution does not 
depend directly on the value of n. The latter is a feature that will assume 
central importance in Section 3. Finally, some further examples are given 
of situations where the Poisson approximation provides a useful model. 


The number of successes in n independent Bernoulli trials, each with the 
same probability of success, p, has the binomial distribution, B(n, p). We 
will now investigate this distribution for situations where n is large and p 
is small. The first situation we look at concerns bicycle usage and getting 
caught in a downpour. 





Example 1 Bicycle usage 


One disadvantage of cycling is the possibility of arriving at one’s 
destination drenched with rain, having been caught in a downpour. This is 
a very unpleasant experience, but actually its frequency has been 
estimated at only 15 times a year for a regular cyclist. 





1 Bernoulli trials and the Poisson distribution 


It is not entirely clear how the estimate of 15 times a year has been 
calculated. So if we wish to use it for calculating the probability, say, of 
escaping a drenching over the next month, then we need to make some 
further assumptions. To simplify the situation, we will assume that the 
probability of getting drenched does not change from one journey to the 
next, and that whether or not a drenching occurs on a journey is 
independent of experiences on previous journeys. That is, we will assume 
that cycle rides form a sequence of independent Bernoulli trials with 
constant probability of a drenching (a ‘success’!). 


We are told that a ‘regular cyclist’ gets drenched approximately 15 times a The simplifications we have 

year, on average, or 15/12 = 1.25 times a month. However, we do not made imply the rather 

know n, how many cycle journeys a month a regular cyclist makes, or the — Urealistic ae e the 

value of p, the proportion of rides that result in a drenching. In A aeseshc atlas es 
aa ; ; . the same for each month of the 

Activities 1 and 2 you are asked to investigate the effect of assuming year. 

different numbers of journeys a month on the distribution of the number of 


drenchings in a month. 


Activity 1 = Drenchings for Chris 


Chris uses her bicycle regularly for travelling to and from work and, 
occasionally, for shopping and social visits. Suppose that she gets drenched 
15 times a year on average. Her own assessment of cycle usage (which is 
fairly rough) works out at 50 journeys a month of a reasonable distance. 


(a) Estimate the proportion of Chris’s journeys that result in a drenching. 


(b) Suggest a probability model for X, the number of times a month that 
Chris gets drenched. 





(c) According to the model you suggested in part (b), what is the mean 
number of times that Chris gets drenched in a month? 


(d) Use your model to calculate the probability that over a month: 
(i) Chris does not get drenched at all 
(ii) Chris gets drenched twice. 


Chris’s guess at a monthly total of 50 rides is just that — a guess. Suppose 
that a better estimate is 60 rides a month. In this case, using the given 
average of 15 drenchings a year, the estimated probability of a drenching 
during a single ride is 

1 1B 
12x60 720 48° 
In this case, the assumed probability distribution for the number of 
drenchings a month is B(60, 1/48). 





p= 
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You calculated two of these 
probabilities in Activity 1. 
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Activity 2 Probabilities of drenchings 


Table 1 contains certain probabilities for drenchings assuming the binomial 
distribution B(50,1/40). It also contains some of the corresponding 
probabilities of drenchings assuming the binomial distribution B(60, 1/48). 
Complete the table by computing the missing probabilities of drenchings 
assuming the binomial distribution B(60, 1/48). 


Table 1 More probabilities of drenchings while cycling 
P(X =0) P(X=1) P(X =2) P(X =3) P(X>4) 


B(50,1/40) 0.2820 0.3615 0.2271 0.0932 0.0362 
B(60, 1/48) 0.3610 0.0932 0.0365 


What do you notice about these probabilities? 


In Activity 2, you saw that the probabilities of drenchings that were 
calculated assuming the higher estimate of 60 rides a month are not very 
different from those calculated assuming 50 rides a month. Also, the mean 
number of times that Chris gets drenched according to this second model 
is n x p = 60 x (1/48) = 1.25, the same as the mean of the model assuming 
50 rides a month (Activity 1(c)). 


Now suppose that Chris actually averages 40 cycle rides a month. In this 
case, the estimated probability of a drenching during a single ride is 

p = 1/32; and the assumed probability distribution of the number of 
drenchings a month is B(40,1/32), which also has mean 1.25. Some 
probabilities of drenchings for 40 rides a month are given in Table 2, 
together with those previously calculated for 50 and 60 rides a month. 


Table 2 Probabilities of drenchings while cycling 
PIX=0) P(X=1) P(X=2) P(X=3) P(X >24) 


B(50,1/40) 0.2820 0.3615 0.2271 0.0932 0.0362 
B(60,1/48) 0.2827 0.3610 0.2266 0.0932 0.0365 
B(40,1/32) 0.2808 0.3624 0.2280 0.0931 0.0357 


Notice that the values for 40 rides a month are close to those previously 
calculated for 50 and 60 rides a month. 


These calculations have all been based on the assumption that the mean 
number of drenchings a month for a regular cyclist is 1.25. If the number 
of cycle rides a month is n, then the corresponding estimate for the 
probability p of a drenching in a single ride is 

oO 5 15, 
P= i3xn n°? 
and the distribution used to model the number of drenchings a month is 
the binomial distribution B (n, 1.25/n). 


1 Bernoulli trials and the Poisson distribution 


The results in Table 2 suggest that, for values of n such as those used (50, 
60 and 40), probability calculations for the binomial distribution 

B(n, 1.25/n) lead to very similar results. Notice that, whatever the value 
of n, the mean of this binomial distribution is u = n x p = 1.25, the 
average number of drenchings a month. So the question arises: can we 
satisfactorily model the number of monthly drenchings for a regular cyclist 
using a random variable whose distribution depends only on a single 
parameter, the expected number p, rather than two parameters n and p? 
The answer is that we can. As you already know from Section 4 of Unit 3, 
the approximating distribution will be the Poisson distribution. The 
relevant result is given in the following box. 


Poisson’s approximation for rare events 


For large values of n and small values of p, a random variable 
following the binomial distribution, B(n, p), has approximately the 
same distribution as a random variable following the Poisson 
distribution with parameter np: 


B(n, p) © Poisson(np). The symbol ‘~’ is read as ‘is 
: s approximately the same 
Equivalently, if u = np, then distribution as’. 


B(n, u/n) ~ Poisson(u). 


The approximation improves as n increases. 


In Units 3 and 4, the Poisson parameter was denoted A, and in Unit 4 you 
saw that A is also the mean of the Poisson distribution. In this section, 
because we’re equating the means of the binomial and Poisson 
distributions and denotes the mean, it is convenient to use u to denote 
the Poisson parameter. 


In the following example, we compare the probabilities associated with the 
Poisson approximation with those already calculated using binomial 
models for the example of cyclist Chris getting drenched. 


Example 2 Poisson and binomial probabilities of drenchings 


Several probabilities calculated for the binomial distributions B(50, 1/40), 
B(60, 1/48) and B(40, 1/32) were given in Table 2. Each of these 
distributions has mean = np = 1.25, so the Poisson approximation is 


Poisson(1.25). This gives the following probabilities: The p.d.f. of the Poisson(:) 
p(0) =e}? ~ 0.2865, See ie ete 
195 
1.25 
p) = —= ~ 0.3581, 
—1.25 2 
1.25 
p(2) = Bai ~ 0.2238, 


and so on. Table 3 (overleaf) shows the binomial probabilities from 
Table 2 and the corresponding Poisson probabilities. 
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Table 3 Yet more probabilities of drenchings while cycling 
P(X=0) AX =D P(X =2] P(X =3) P(X =4) 


B(50, 1/40) 0.2820 0.3615 0.2271 0.0932 0.0362 
B(60, 1/48) 0.2827 0.3610 0.2266 0.0932 0.0365 
B(40, 1/32) 0.2808 0.3624 0.2280 0.0931 0.0357 
Poisson(1.25) 0.2865 0.3581 0.2238 0.0933 0.0383 


As you can see, the probabilities agree in the first decimal place and differ 
by at most 1 in the second decimal place, so the approximation is a 
reasonable one. 





So far we have looked at only one example involving a situation where the 
parameter n of the binomial distribution is large and the corresponding 
value of p is small. We have calculated probabilities only for binomial 
distributions with mean u = 1.25. Do similar results hold for binomial 
distributions with other means? It seems from the mathematics that they 
do, but whether or not the approximation is a good one depends on the 
values of n and p = u/n. 


To investigate the circumstances in which the approximation is a good one, 
we must consider a variety of values of n and p = u/n. However, you 
doubtless found the calculations involved in Activities 1 and 2, where n is 
large, to be time-consuming using a calculator. This is a situation where 
the computer comes into its own: it will do these sorts of calculations 
quickly and allow you to concentrate on the question being investigated. 
You are asked to carry out such an investigation in Chapter 9 of Computer 


Book A. 


Refer to Chapter 9 of Computer Book A for another example of 
the approximation of binomial probabilities by Poisson 
probabilities. 


So you have seen that when the parameter p of a binomial distribution is 
small and the parameter n is fairly large, the actual values of the binomial 
parameters are not critical to the values of the binomial probabilities (see, 
for example, Table 3). All that is required to find approximate values for 
these probabilities is the product np, which is the mean of the binomial 
distribution. You also know that the model which provides a good 
approximation to the binomial distribution when p is small and n is large 
is the Poisson distribution. (The Poisson distribution depends on only one 
parameter, np, rather than on the two separate parameters, n and p, of the 
binomial distribution.) A rough rule is now given for deciding when the 
Poisson distribution provides a good approximation to the binomial 
distribution. 


1 Bernoulli trials and the Poisson distribution 


A rough rule for using Poisson’s approximation 


e Ifnis large and p is small (n > 50 and p < 0.05, say), the 
approximation is good. 


e When p is small enough, the approximation is good even for quite 
small values of n. The smallest value of n for which the 
approximation is good decreases as the value of p decreases. 


It is unlikely that the rough rule given above agrees exactly with the rule 
you formulated when investigating binomial distributions in Chapter 9 of 
Computer Book A. Whether you regard the values of two probabilities as 
‘close’ is a subjective matter; and how close you require them to be 
depends on what they are to be used for. There are a few (rare) situations 
where the third or fourth decimal place is critical. On the other hand, it is 
unlikely that you would feel the assessment of the chances of several 
drenchings a month for a regular cyclist to be one of these! 


There are many other practical situations in which a binomial model with 
a small value of p would apply, and hence for which the Poisson 
distribution could also reasonably be used. A few more examples are 
described and investigated in the rest of this section. 





Example 3 Birthdays 


Suppose we have a group of 100 unrelated people. Let X be the random 
variable denoting the number of people in a group of 100 unrelated people 
whose birthday falls on 1 April. Then, making the assumption that births 
are independent and distributed evenly over the year, and ignoring Leap 
Day, the number of people in the group with a birthday on 1 April has a 
binomial distribution: X ~ B(100, 1/365). So the probability that, say, 
precisely one person in the group was born on 1 April is given by 


100 99 
P(X =1)= ( i ) (sts) (384) ~ 0.2088. 


The binomial distribution has mean u = 100 x 1/365 œ 0.2740. So, using 
Poisson’s approximation, we have 


P(X = 1) ~ 0.274e-°?4 ~ 0.2083. 


As you can see, the two probabilities are very close. 





Activity 3 Faulty resistors 


Suppose that a process manufacturing electrical resistors results in a small 
proportion, approximately 1 in 20, not working when they leave the 
factory. This is a tolerable risk accepted by purchasers: these resistors are 
not quality tested before they are packaged, because to do so would add 
considerably to their cost. The resistors are boxed in packages of 50. 


In other texts, you may see 
different rules. One such rule 
requires n > 100 and np < 10. 


Actually, the proportion of 
births on any day varies from 
day to day and from month to 
month. For example, during a 
holiday period, fewer births than 
usual are induced. The birth 
rate falls on Sundays, too. 
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What happens if you don’t get a 
seat? 





For example: ‘And now, the 
Superstore — unequalled in size, 
unmatched in variety, unrivalled 
inconvenience.’ Taken from 
Toseland, M. (2009) A Steroid 
Hit the Earth, Anova Books. 
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Suppose also that whether or not any particular resistor is faulty is 
independent of whether or not any other resistor is faulty, and that the 
probability that a resistor is faulty is the same for all resistors. 


(a) State an ‘exact’ model for the number of faulty resistors in a box. Use 
this model to calculate the probabilities that there is one faulty 
resistor in a box and that there are three faulty resistors in a box. 


(b) Write down an approximate distribution for the number of faulty 
resistors in a box, and use this to calculate probabilities corresponding 
to those you found in part (a). 


(c) Comment on any differences between the values you obtained in 
parts (a) and (b). 


Activity 4 Getting a seat 


On average, 1% of passengers who book seats on a particular flight do not 
turn up for the departure of the flight. The airline sells 200 tickets for this 
flight; only 198 seats are available. Find an approximate value for the 
probability that everyone who turns up for the flight will have a seat. You 
may assume that passengers who have booked a seat either turn up or do 
not turn up for the flight independently of each other. 


Exercises on Section 1 





Exercise 1 Typographical errors 


One of the necessary stages in getting a book published is the careful 
checking of the proofs (the first version of the printed pages). Usually 
there are typographical errors that need correcting. With a good publisher 
and printer, such errors, even in the first proofs, are not too numerous. 
Experience with one publisher of children’s books suggests an average of 
3.6 errors per page. 


Taking a children’s paperback and counting the number of words on a 
randomly selected page gave a word count of 320. If we assume there are 
320 words on a page, then an estimate of the probability that a word is 
mistyped is 
— 3.6 

P = 320 
If we assume independence from word to word, then X, the number of 
errors per page, has a binomial distribution with parameters n = 320 and 
p = 0.011 25. 


= 0.011 25. 


Of course, the number of words per page is not constant from page to 
page. Turning to another page, the number of words would almost 
certainly have been different — 360, perhaps. In this case, we would have 
assumed a binomial model for the number of errors per page with 
parameters n = 360 and p = 3.6/360 = 0.01. 


2 Waiting times between events 


Table 4 contains some probabilities for the number of errors on a page 
using binomial models with n = 320 and n = 360. 


Table 4 Probabilities of errors 
P(X=0) P(X=1) P(X=2) P(X =3) P(X >24) 


B(320,0.01125) 0.0268 0.0975 0.1769 0.2134 0.4854 
B(360, 0.01) 0.0268 0.0976 0.1769 0.2133 0.4854 


(a) Suggest an approximate model for the number of errors on a page. 


(b) Use this approximate model to calculate probabilities corresponding to 
those given in Table 4. 


(c) Comment on any differences between the results given in Table 4 and 
those you obtained in part (b). 





Exercise 2 Blue-yellow colour vision deficiency 


Blue-yellow colour vision deficiency, or tritanopia, is very rare, and it 
occurs equally in men and women. The probability that a person has this 
type of colour vision deficiency is about 1 in 10000, or 0.0001. 


(a) Use an exact model to find the probability that there will be exactly 
one person with tritanopia in a group of 100 people. 


(b) Use a Poisson model to find an approximate value for the probability 
that there will be exactly one person with tritanopia in a group of 
100 people. Comment on the comparison between the probabilities 
calculated in parts (a) and (b). 


2 Waiting times between events 


Many events occur repeatedly, but ‘at random’. For example, if a fair 
six-sided die is rolled repeatedly, sometimes a six is obtained (a success) 
and sometimes it is not. Similarly, if an arrow is aimed at a target on 
many occasions, some shots will hit the centre of the target (successes) and 
some will miss. In either case, an event (a success) may occur at each of a 
discrete set of opportunities (trials) taking place one after the other. It is 
not possible to predict at which trials the event will occur and so, in this 
sense, the events occur ‘at random’. 


However, there are many situations in which events seem to occur 
unpredictably, that is ‘at random’, but where there is no notion of a 
specific trial or opportunity when an event may occur. For instance, events 
such as floods, earthquakes and accidents may occur at any time. 


Two models for the times between events that occur ‘at random’ are 
discussed in this section: one discrete probability distribution and one 
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continuous probability distribution. The discrete distribution — which is 
one of the ones that you have met already — is a model for the times 
between successive events in situations where events may occur only at a 
discrete set of opportunities or trials; this is discussed in Subsection 2.1. 
The continuous distribution, which is new to you and is discussed in 
Subsection 2.2, is a model for the times between events that may occur at 
any time. 


2.1 Bernoulli processes 


The notion of a collection or sequence of independent Bernoulli trials for 
which the probability of success remains constant from trial to trial was 
introduced in Unit 3. Such collections of Bernoulli trials were fundamental 
to the development of the binomial and geometric distributions in that 
unit. Thought of as sequences — that is, when trials take place sequentially 
through time — these collections of Bernoulli trials are called Bernoulli 
processes. The formal definition of a Bernoulli process is given in the box 
below. 


Bernoulli processes 


A Bernoulli process is a sequence of Bernoulli trials in which 
e trials are independent 


e the probability of success remains the same from trial to trial. 


In a Bernoulli process it is assumed that an event either occurs, or does 
not occur, at each of a clearly defined sequence of opportunities (called 
trials). Three typical realisations of a Bernoulli process — that is, three sets 
of observed outcomes of a Bernoulli process — are shown in Table 5. In 
each of these three realisations, the parameter p, the probability of a 1 

(a success) at each trial, was equal to 0.3. In each case, the outcomes of 
the first 25 trials in the realisation have been recorded. 


Table 5 The first 25 trials in three realisations of a Bernoulli process 


(a) 0 1000001101 10001000210212100 
(ib) oO OT OO LL 100 OOO LOL Oot oo 0 
(c) 0 1101101000010000110100000 


In Unit 3, you saw that X, the number of trials up to and including the 
first success in a Bernoulli process, has a geometric distribution with 
parameter p (and mean 1/p). In the three realisations in Table 5, the 
observed values of X are 2, 4 and 2, respectively. But what is the 
distribution of the number of trials from one success to the next, that is 
from the first success to the second, or from the second success to the 
third, and so on? 


One of the properties of the Bernoulli process is that the probability of a 
success at any trial remains the same and is equal to the probability of 


2 Waiting times between events 


success at all preceding trials, whatever the history of the process up to 
that trial. So even a long run of failures (that is, of 0s) does not make the 
probability of a success (a 1) at the next trial any more likely. There is no 
question of ‘luck being bound to change’. Similarly, a long run of 1s (a 
‘run of good luck’) does not alter the probability of a 1 at subsequent 
trials. The incidence of 1s may be regarded in a special sense as ‘random’: 
knowledge of the whole history of the process so far — that is, the outcome 
of all the trials so far — does not alter the future development of the 
process, or the probability laws describing its future development. 

A statistical experimenter, who knows the probability of success in each 
trial and is about to embark on trial number 1001, is not helped at all in 
her assessment of what might or might not happen at that trial by 
knowing what has happened at any or all of the preceding one thousand 
trials. This is known as the memoryless property of the Bernoulli process. 


Since a Bernoulli process has this memoryless property, it follows that at 
any point in a Bernoulli process, the number of trials from that point until 
the next success has the same probability distribution as X, the number of 
trials from the start of the process until the first success. So, in particular, 
the number of trials from one success to the next has a geometric 
distribution with parameter p, written G(p). Hence we can use X ~ G(p) 
to represent the number of trials from one success up to and including the 
next — the ‘waiting time’ to the next success — not just for the number of 
trials from the start up to and including the first success. 


In each of the realisations in Table 5, several independent values of X can 
be observed. In realisation (a), they are 2, 6, 1, 2, 1, 4, 4, 2, 1; and the 
next value of X must be 3 or more. Notice that, to match what we did 
when considering the number of trials from the start to the first success, 
after each success we count the number of failures, 0s, plus one for the next 
success, 1; that is, we include in our waiting time the trial on which the 
next success happens. 


Activity 5 Observed values of X 


Write down the observed values of X in each of realisations (b) and (c) in 
Table 5. In each realisation, what can you say about the next value of X? 


Recall from Unit 3 that the geometric distribution has p.m.f. 
p(z) = P(X = 2) =(1—p)*'p 

and c.d.f. 
F(x) = P(X < z)=1- (1- p)”, 


respectively, on range x = 1,2,.... 


277 


Unit 5 Events occurring at random and population quantiles 





278 


Activity 6 Times between successes 


Suppose that the probability of a success in each trial in a Bernoulli 
process is equal to 0.3. 


(a) Find the probability that the number of trials from one success to the 
next is equal to (i) 2, (ii) 6. 


(b) Find the probability that the number of trials from one success to the 
next is 3 or more. 


Activity 7 Obtaining your next six 


Suppose that in a board game you roll a fair six-sided die repeatedly. 


(a) Find the probability that after obtaining a six you will have to roll the 
die exactly four times to obtain another six. 


(b) Find the probability that after obtaining a six you will have to roll the 
die at least five times to obtain your next six. 





Example 4 Intervals between serious earthquakes 


Table 6 contains some data on serious earthquakes worldwide. An 
earthquake has been included if over 1000 people were killed. The data 
cover the period from 15 August 1950, when 1530 people were killed in the 
Assam-—Tibet earthquake, to 11 March 2011, when 1479 people were killed 
in the Tohoku earthquake and tsunami, in Japan. Altogether 68 serious 
earthquakes occurred in this period. Table 6 contains the 67 ‘waiting 
times’ in days between successive earthquakes. The numbers should be 
read across the rows. 


Table 6 Time intervals between serious earthquakes (days) 


186 77 682 449 91 1027 164 808 83 832 
328 1120 743 328 163 83 64 356 324 257 
503 232 252 151 142 32 120 100 561 755 

44 200 47 503 320 690 386 147 533 109 
560 26 460 420 291 474 131 643 71 270 
115 240 204 34 493 423 422 219 366 92 
194 230 717 506 104 91 332 


(Source: adapted from the Global Significant Earthquake Database 
of the US National Geophysical Data Center) 


There are no instances of two or more serious earthquakes occurring on the 
same day, though such a coincidence is not impossible. So it might be 
reasonable, at least as a first approximation, to regard each day as an 
opportunity or trial during which a serious earthquake might or might not 
happen. 
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Activity 8 Modelling the times between serious earthquakes 


Suppose that the occurrences of serious earthquakes are modelled as a 

Bernoulli process. 

(a) What assumptions about the occurrences of serious earthquakes are 
made? 

(b) What distribution is used to model the time between serious 
earthquakes? 


(c) A histogram of the data in Table 6 is shown in Figure 1. By 
considering the shape of this histogram, say whether or not you think 
the model you proposed in part (b) might be a reasonable one. 
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Figure 1 Time intervals between serious earthquakes (days) 


(d) If a Bernoulli process is a good model, how could you use the data to 
estimate the probability that on any individual day a serious 
earthquake will occur? 





Example 5 /ntervals between serious earthquakes, continued 


For the data in Table 6, the mean time in days between serious 
earthquakes is 

186+ 77+---+332 _ 22120 

67 © 67 
(which is just under 11 months). So, as suggested in the solution to 
part (d) of Activity 8, an estimate of the probability that a serious 
earthquake occurs somewhere in the world on any individual day is given 
by the reciprocal of this sample mean; that is, an estimate of p is 
67 


~ 330 





Serious damage caused by a 
major earthquake 


This is another instance of using 
a histogram to portray discrete 
data with a large range. 


Reminder: you will notice that 
the module authors are happy to 
allow the number of decimal 
places in the results of 
calculations to vary, as you may, 
in ways that seem appropriate to 
the context at hand. 
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The estimated probability of a gap between serious earthquakes exceeding 
three years (say) can be found using the c.d.f. of X, the waiting time 
between successive serious earthquakes. Ignoring the possibility of a leap 
year, there are 1095 days in three years, so the probability of a gap longer 
than three years is given by 


P(X > 1095) = 1 — F(1095) = 1 — (1 — (1 — p)!) 
67 1095 
=(1—p) = (1 - sun) ~ 0.0361, 


or about 1 in 28. Notice that, for the period covered by the data in 

Table 6, one interval out of the 67 exceeded three years: there was an 
interval of 1120 days between the twelfth and thirteenth earthquakes 
recorded. (The twelfth was on 26 July 1963 in Skopje, Macedonia, and the 
thirteenth, in Varto, Turkey, was on 19 August 1966.) There were fewer 
gaps of more than three years than are predicted by the model. 





Activity 9 Gaps longer than two years 


Following on from Examples 4 and 5, according to the geometric model, 
what is the probability that the interval between serious earthquakes will 
exceed two years? (You may ignore the possibility of a leap year.) How 
many cases of gaps longer than two years between serious earthquakes are 
recorded in Table 6? 


Both as a summary and for comparison with results further on in this unit, 
the main properties of a Bernoulli process are highlighted below. 


The Bernoulli process: two main results 


For a Bernoulli process, that is, a sequence of independent Bernoulli 
trials with constant probability of success, p: 


e the random variable which represents the number of successes in 
n trials has a binomial distribution with parameters n and p 


e the waiting time from after one success up to and including the 
next success has a geometric distribution with parameter p. 


An underlying assumption implicit in using the geometric distribution to 
model the intervals between serious earthquakes is that at most one serious 
earthquake can occur on any day: each day was regarded as a trial at 
which an earthquake might or might not happen. However, although it is 
unlikely that two or more serious earthquakes will occur on the same day, it 
is not impossible. In reality, an earthquake may occur at any time on any 
day. Also, the exact length of the interval from one serious earthquake to 
the next is unlikely to be a whole number of days, but is measured in days, 
hours and minutes. A continuous model for the times between earthquakes 
is needed. In Subsection 2.2, such a continuous model is discussed; this 
model is the continuous-time analogue of the geometric distribution. 


2 Waiting times between events 


2.2 The exponential distribution 


The histogram in Figure 1 suggests that a suitable continuous probability 
model for the intervals between serious earthquakes will be one with a 
highly skew probability density function, starting high for values near 0 
and tailing off for longer waiting times (rather as the geometric probability 
mass function does). Many probability density functions could be 
constructed with this sort of shape. However, one of these probability 
density functions turns out to belong to the distribution with properties 
most like those of the geometric distribution, and therefore forms a possible 
model for the distribution of the waiting time from one serious earthquake 
to the next. This is the exponential distribution. It is defined as follows. 


The exponential distribution 


A random variable X is said to have an exponential distribution 
with parameter A, where > 0, if it has probability density 
function 


(QS ae S10). (1) 


This is written X ~ M(A). Here, e is the exponential function which 
also appears in the Poisson probability mass function. 


The exponential distribution is the continuous analogue of the geometric 
distribution. The probability density function of a typical exponential 
distribution is shown in Figure 2(a). Notice the similarity between its 
general shape and that of the probability mass function of a geometric 
random variable (see Figure 2(b)). The exponential p.d.f. is in fact a 
decreasing function of x for any value of its (positive) parameter A. 
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Figure 2 (a) The p.d.f. of a typical exponential distribution; (b) the p.m.f. of a typical geometric 
distribution 


You will recall that for a continuous random variable, the cumulative 
distribution function is very useful for calculating probabilities. The c.d.f. 
of an exponential distribution is given in the box below. 
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The c.d.f. of an exponential random variable 


If the random variable X has an exponential distribution with 
parameter A, then its c.d.f. is given by 


JG) Sao" ao! (2) 


As usual, to derive the c.d.f. of a continuous distribution from its p.d.f., 
integration is used. You will do this for yourself in Activity 11. First, 
though, in the case of the exponential distribution, it is clear from 
Equation (1) that you need to be able to integrate an exponential function. 
Integrating an exponential function is a fairly straightforward thing to do, 
the results you need being given next. 










a REDUCE D 
AEA 





“It’s got more special-function keys than 
you'll find on many of the larger models: 
square root, cosine, logarithmic, integral and 
exponential keys. I'd say that more than 
makes up for the fact that it doesn’t have the 
number nine!” 
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2 Waiting times between events 


Integrating the exponential function 


Suppose we want to integrate e°” where a is a non-zero constant. The 
formula for its indefinite integral is 


gue 
ie dz = — +e 
a 


In words, the integral of the exponential function e°” is the same 
exponential function, e°”, divided by the constant a. Definite 
integrals of e°” therefore have the form 


ie Bo de= =] T2 = 1 (ea PA ey, 
z 


1 a T1 a 





Example 6 Integrating exponentials 


To illustrate applying these rules, we have 


2y 
Ww _ € 
fe acs +c 


and 





Activity 10 /ntegrating exponentials 


Find each of the following integrals. 


(a) / Pde 
(b) | tda 
(c) [>a 


Activity 11 From exponential p.d.f. to exponential c.d.f. 


The exponential p.d.f. is given by Equation (1). Confirm that the c.d.f. of 
the exponential distribution is given by Equation (2). 
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This result may be obtained 
using integration but you need 
not see the details. 
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The exponential c.d.f. corresponding to the exponential p.d.f. in 
Figure 2(a) is shown in Figure 3. 


14 








x 


Figure 3 The c.d.f. of the exponential distribution whose p.d.f. is shown 
in Figure 2(a) 


As you can see, the family of exponential distributions is indexed by a 
single parameter À > 0. But what does this parameter represent? 


For the geometric distribution, the mean is given by 1/p, where p is the 
parameter of the distribution. The parameter À of an exponential 
distribution has a similar role: the mean p of an exponential distribution is 
equal to 1/X. 


The mean of an exponential distribution 


If the random variable X has an exponential distribution with 
parameter A, then 


B(X) = 5. (3) 


So A is equal to the reciprocal of the mean u. Hence, returning to the 
earthquake example of Subsection 2.1, if represents the mean waiting 
time between serious earthquakes, then A represents the average rate at 
which serious earthquakes occur. Indeed, is often referred to as the rate 
parameter. Given data, this result can be used to estimate the 

parameter A. 





Example 7 More on modelling the times between serious earthquakes 


The mean of the waiting times in Table 6 is 22120/67 ~ 330 days. If an 
exponential distribution is used to model the variation in the waiting times 
between serious earthquakes, then an estimate of the parameter A of the 
distribution is 67/22120 ~ 0.0030. 


In Subsection 2.1, a geometric distribution was used to model the waiting 
times between serious earthquakes. Using this model we found that, for 
instance, the probability of a gap between serious earthquakes exceeding 
three years (i.e. 1095 days) was approximately 0.0361. Using an 


2 Waiting times between events 


exponential model with parameter A = 67/22120, the corresponding 
probability is 


P(X > 1095) = 1 — F(1095) 
=1- (1 — pee) using Equation (2) 
ag HEE ~ 0,0363. 


So, to three decimal places, the exponential model gives the same result as 
the geometric model. 





Activity 12 Understanding and using the exponential model 


(a) The p.d.f. of the exponential model with parameter A = 67/22120 is 
shown in Figure 4. 
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Figure 4 The p.d.f. of the exponential distribution with A = 67/22120 


(i) Mark on a copy of Figure 4 the area corresponding to the 
probability that the interval between serious earthquakes will 
exceed two years (that is, 730 days). 


(ii) Use the formula to find what is, according to this model, the 
probability that the interval between serious earthquakes will 
exceed two years. Comment on the comparison between this 
probability and that provided by the geometric distribution in 
Activity 9. 


(b) According to this exponential model, what is the probability that the 
gap between successive earthquakes will be a week or less? 


(c) Use the exponential model to find the proportion of waiting times 
between serious earthquakes that are longer than average, that is, the 
proportion that exceed 22120/67 ~ 330 days. Compare this proportion 
with the proportion of waiting times in Table 6 that exceed 330 days. 
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Presumably, if you can catch one 
of these number 55 buses you’d 
have been waiting longer than 
average for one of them to come 
along! 


Like the mean, the variance may 
also be found using integration. 
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As you know from Subsection 4.3 of Unit 2, the probability that a random 
variable X takes values between, say, xı and z2, 71 < £2, is also available 
from the c.d.f. of X: 


Play < X < 23) = F (a2) — F(a). 
Applying this to the exponential distribution, 
P(a1 < X < #2) =1-e 7% —(1-e*™), 


which reduces to the following formula for probabilities of lying within 
intervals under this model. 


Probabilities associated with an exponential random variable 


If the random variable X has an exponential distribution with 
parameter A, then 


PAGO em cm 2, (4) 


Activity 13 More on understanding and using the exponential model 


(a) Figure 4 showed the p.d.f. of the exponential model, with 
A = 67/22120, that we are using for the data on time intervals between 
serious earthquakes. Mark on a copy of that figure the area 
corresponding to the probability that the interval between serious 
earthquakes will be between one and two years (that is, between 365 
and 730 days). 


(b) Use Equation (4) to find what is, according to the model, the 
probability that the interval between serious earthquakes will be 
between one and two years. 


(c) According to the exponential model, what is the probability that the 
gap between successive serious earthquakes will be between a week and 
a month (taken to be 30 days)? 


Activity 14 Longer than average waiting times 


Suppose that the waiting times between events occurring at random may 
be modelled by an exponential distribution with parameter A. According 
to the model, what proportion of waiting times will be longer than the 
mean waiting time? 


In this subsection, the exponential distribution has been defined. A 
formula for the c.d.f. of an exponential distribution has been given; and 
the result that the mean of an exponential distribution with parameter A is 
1/A has been stated without proof. A formula for the variance is also 
stated, without proof, in the box below, where several properties of 
exponential distributions are summarised. 


2 Waiting times between events 


Properties of exponential distributions 


If the random variable X has an exponential distribution with 
parameter A, X ~ M(A), then 


e the p.d.f. of X is 
fi@\=xe ae 

e the c.d.f. of X is 
P@)\=1S=e2". 2 50 


e the mean of X is 


ll 
= exe) = > 
u= E(X) F, 
e the variance of X is 
il 
e and hence the standard deviation of X is 
1 
B= VCD = = (5) 


Notice that the mean and standard deviation of an exponential 
distribution are equal. This provides a method for checking quickly, given 
data, whether an exponential model is worth considering: if the sample 
mean and sample standard deviation differ greatly, then an exponential 
model is not appropriate. Of course, even if the sample mean and sample 
standard deviation are roughly equal, an exponential distribution may not 
be a suitable model. For instance, an exponential model would not be 
appropriate if the shape of a histogram of the data was very different from 
that of an exponential p.d.f. 


Exercises on Section 2 


Exercise 3 Explosive volcanic eruptions 


The histogram in Figure 5 (overleaf) represents the 55 intervals in months 
between 56 successive major explosive volcanic eruptions in the northern 
hemisphere. The dataset includes all such eruptions that took place 
between the beginning of 1851 and the beginning of 1985. (The first 
occurred in March 1853 and the last in July 1983.) 


The sample mean is 28.4 months and the sample standard deviation is 
31.7 months. 





(a) Explain briefly whether or not you think an exponential distribution 
might be a reasonable model for the intervals between major explosive 
eruptions in the northern hemisphere. 


(b) Which model do you think is more appropriate in this case — a 
geometric distribution or an exponential distribution? (You will 
investigate using both models in Exercises 4 and 5.) 
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Figure 5 Intervals between eruptions 


(Source: adapted from Solow, A.F. (1991) ‘An exploratory analysis of occurrence 
of explosive volcanism in the northern hemisphere’, Journal of the American 
Statistical Association, vol. 86, no. 413, pp. 49-54) 





Exercise 4 Using a geometric model 


Despite the solution to Exercise 3(b), this exercise concerns using a 
geometric model to model the intervals between successive major explosive 
volcanic eruptions in the northern hemisphere. 


(a) What assumptions about the occurrences of major explosive volcanic 
eruptions in the northern hemisphere are you making by using a 
geometric model? 


(b) Use the information given in Exercise 3 to find an estimate for the 
parameter p of the geometric model. 


(c) According to the geometric model, what is the probability that the 
interval between successive major explosive volcanic eruptions in the 
northern hemisphere will exceed ten years? How many cases of 
intervals exceeding ten years were there? 





Exercise 5 Using an exponential model 


This exercise concerns using an exponential model for the intervals between 
successive major explosive volcanic eruptions in the northern hemisphere. 


(a) Use the information given in Exercise 3 to find an estimate for the 
parameter of the exponential model. 


(b) According to the exponential model, what is the probability that the 
interval between successive major explosive volcanic eruptions in the 
northern hemisphere will exceed ten years? Compare this result with 
the probability you obtained using a geometric model in Exercise 4. 


(c) According to the exponential model, what is the probability that the 
interval between successive major explosive volcanic eruptions in the 
northern hemisphere will be between one and five years? 


3 The Poisson process 


In this section a process rather similar to the Bernoulli process is 
introduced. However, the process considered here develops in continuous 
time, rather than as a sequence of 0s and 1s occurring at a discrete set of 
trials (as in the Bernoulli process). This process is called the Poisson 
process. 


In Subsection 3.1 the Poisson process and some of its properties are 
discussed. But given data on the occurrences of events, how can we decide 
whether a Poisson process is a reasonable model? In Subsection 3.2 we 
take a look at this problem. 


3.1 A continuous-time analogue of the Bernoulli 
process 


A Bernoulli process consists of a sequence of trials which result in a 
sequence of Os and 1s occurring at a discrete set of points. In the sorts of 
situation considered in this section, events occur after intervals, or waiting 
times, that are random in the sense that knowing how many events have 
occurred so far, or how long it is since an event last occurred, is of no use 
in forecasting when the next event will occur. These waiting times 
correspond roughly to sequences, or runs, of Os in a realisation of a 
Bernoulli process: when an event happens, it is rather as though a 1 has 
been recorded. But the waiting times are not integers, and successive 
events are not restricted to a discrete set of opportunities. 


Here are some further examples of the sorts of phenomenon that could be 
modelled in this way, in addition to the occurrences of earthquakes and 
volcanic eruptions. 





Example 8 Random events occurring in continuous time 


(a) Even in a machine shop where a regular maintenance programme is 
carried out, there are occasional machine breakdowns. However, their 
occurrence cannot be forecast. Sometimes the repair staff are 
overworked, and sometimes there is nothing for them to do. All that is 
known from repair records is that the rate at which breakdowns occur 
remains constant. 


(b) Roughly twice a year, extreme wind conditions sufficient to ground 
aircraft are recorded at a small airport on the south coast of England, 
but their time of occurrence does not seem to be linked to the time of 
the year. 


(c) Spam emails arrive in mailboxes seemingly at random and, over long 
stretches of time, at an apparently constant rate. (Emails and other 
electronic messages which are of interest to the receiver tend to arrive 
in a less random pattern as they are interacted with.) 


3 The Poisson process 


In the context of events 
occurring at random in time, the 
rate is the average number of 
occurrences per unit time. 
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(d) In England, the Highways England Traffic Officer Service monitors 
motorways, and has breakdowns reported to it. There are many more 
during the day than there are at night, and noticeably more on Friday 
and Sunday evenings than on Saturday evenings. However, within 
shorter periods, such as, for example, the middle of the day or the 
early hours of the morning, the rate at which breakdowns are reported 
may be regarded as constant. And within any such period, there does 
not seem to be any pattern to the times at which the reported 
breakdowns occur. 


(e) Arrivals at a hospital accident and emergency department occur 
without warning but at rates that, depending broadly on the time of 
day and the day of the week, are reasonably stable. 


Note, however, that for many situations in which there is some uncertainty 
about when events will occur, those events do not occur completely at 
random in the kind of way exemplified in Example 8. Hence such 
situations will not be amenable to the same kind of statistical modelling. 
Here are some examples that illustrate this. 





Example 9 Events showing a non-random pattern 


(a) The owner of a battery-operated garden thermometer has noticed 
that, over the past six years, batteries advertised to last six months 
have, to within two or three weeks, lasted six months. 


(b) Manufacturers of car tyres build into their tyres a lifetime with as 
small a variability as possible. It is possible to forecast roughly when a 
particular set of tyres will have to be replaced. 


(c) Patients at a dental surgery arrive by appointment and are seen at 
roughly the time stated on their appointment card or email. 





Let us return to processes, like those in Example 8, that can be modelled 
as events occurring at random. A continuous-time analogue of the 
Bernoulli process should possess properties similar to those of a Bernoulli 
process. For instance, only one event can occur at each trial in a Bernoulli 
process; that is, events should occur singly. Also, the probability that an 
event occurs at each trial remains constant, so the rate at which events 
should occur remains constant. In addition, a Bernoulli process has the 
memoryless property: the occurrence of events in the future does not 
depend in any way on how many events have occurred in the past or on 
when they occurred. 


In fact, these properties can be used to define a continuous-time analogue 
of the Bernoulli process. This process is called the Poisson process; its 
definition is given in the following box. 


3 The Poisson process 


The Poisson process 


The Poisson process is a model for the occurrence of events in 
continuous time in which the following assumptions are made. 


e Events occur singly. 
e The rate of occurrence of events remains constant. 


e The incidence of future events is independent of the past. 


Events in a Poisson process can be visualised as occurring along a line 
representing time (or distance) as shown in Figure 6. 





e—x— xx xxx x—* x- x—* xx x—> t (or x) 
Figure 6 Events occurring in continuous time 


For a Bernoulli process, you have seen that the number of successes (or 
events) in n trials has a binomial distribution, and the waiting time from 
one success to the next has a geometric distribution. What are the 
corresponding results for a Poisson process? That is, what is the 
distribution of the number of events that occur in an interval of length t? 
And what is the distribution of the waiting time from one event to the 
next? 


The number of events in an interval of length ¢ 


It can be shown that the number of events that occur in an interval of 
length t has a Poisson distribution. To prove this requires mathematical 
techniques beyond the scope of this module. However, an informal 
argument leading to the required result is given in Example 10. 





Example 10 Calls at a switchboard 


Suppose that the arrivals of calls at a telephone switchboard may be 
modelled as a Poisson process. This means that calls are received at 
random at a constant rate. Let this rate be A per unit time. Then the 
average number of calls received in an interval of length t is À x t. 


Now imagine that this time interval is split into a large number n of very 
short time intervals, each of length t/n. 


Provided that n is large enough, the probability that more than one call is 
received in any interval of length t/n may be regarded as negligible (since 
events occur singly in a Poisson process). 





Since the rate at which calls are received remains constant, the probability e2 O 
p that a call is received in a short interval remains constant from one 
interval to the next. We can, therefore, think of the arrivals of calls as a 
sequence of n independent Bernoulli trials with probability p of a success 
(a call) at each trial. 


Telephone switchboards, old and 
new 


291 


Unit 5 Events occurring at random and population quantiles 


From here on, it is useful to 
revert back to using A (instead 
of u) for the Poisson parameter. 
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It follows that the total number of calls received in n intervals of length 
t/n (the trials) has a binomial distribution with parameters n and p, 
B(n,p), which has mean np. However, the mean number of calls received 
in an interval of length t is At, so p = At/n. Thus, provided that n is large, 
the distribution of the number of calls received in an interval of length t is 
binomial, B(n, At/n). Using Poisson’s approximation for rare events, this 
distribution may be approximated by a Poisson distribution with 
parameter At. 


In fact, this result is exact, not approximate. If the arrivals of calls may be 
modelled as a Poisson process with rate ÀA and X is a random variable 
representing the number of calls received in an interval of length t, then X 
has a Poisson distribution with parameter At: X ~ Poisson(At). 


Notice that the parameter of the probability distribution of the number of 
calls received in an interval depends on both the length of the interval, t, 
and the rate at which the calls are arriving per unit time, A, but only 
through the single quantity that is their product, At. 





The result described in Example 10 for calls arriving at a switchboard 
generalises as follows to events in any Poisson process. 


e Ifthe occurrences of events may be modelled as a Poisson process with 
rate A, and X is a random variable representing the number of events 
that occur in an interval of length t, then X has a Poisson distribution 
with parameter At: X ~ Poisson(At). 


This result states that the number of events in an interval has a Poisson 
distribution and that the parameter of this distribution is equal to the rate 
at which events occur per unit time multiplied by the length of the 
interval. That is, the value of the parameter is the expected number of 
events that will occur in the interval. The calculation of the parameter of 
the Poisson distribution is illustrated in Example 11. The use of this 
distribution to calculate probabilities associated with events in a Poisson 
process will be illustrated shortly. 


Example 11 Telephone calls 


Suppose that the arrivals of telephone calls at a solicitor’s office may be 
modelled as a Poisson process with rate 9 per hour, that is, with A = 9. 
Then, for instance, the number of calls received in t = 15 hours has a 
Poisson distribution with parameter 


At = (9 per hour) x (14 hours) = 13.5; 


and the number of calls received in a ten-minute period has a Poisson 
distribution with parameter 


At = (9 per hour) x ($ hour) = 1.5. 





3 The Poisson process 


Activity 15 The number of events in an interval 


(a) Suppose that the arrivals of telephone calls at a call centre on a 
weekday morning may be modelled as a Poisson process with rate 
3 per minute. Find the distribution of each of the following. 


(i) The number of calls that arrive in a five-minute period. 
(ii) The number of calls that arrive in a quarter of an hour. 


(b) Suppose that arrivals at a hospital accident and emergency department 
on a weekday afternoon may be modelled as a Poisson process with 
rate 5 per hour. Find the distribution of each of the following. 


(i) The number of arrivals in an afternoon (34 hours). 


(ii) The number of arrivals in a 15-minute period. 


The waiting times between successive events 


Now consider the waiting times between successive events in a Poisson 
process. Since the waiting time may take any positive value, a continuous 
model is required. In addition, we require a probability distribution with 
the memoryless property, that is, with the property that the occurrence of 
events in the future does not depend on how many events have occurred in 
the past or on when they occurred. It can be shown, though this will not 
be done here, that the only continuous distribution with the memoryless 
property is the exponential distribution. As mentioned in Section 2, the 
exponential distribution is the continuous-time analogue of the geometric 
distribution. 


In fact, assuming the result that the number of events that occur in an 

interval of length t in a Poisson process with rate A has a Poisson 

distribution with parameter At, it can be shown that the waiting time T We now denote the waiting time 
between successive events has an exponential distribution with random variable by T to avoid 
parameter A. The method involves finding the c.d.f. of the waiting time. confusion with the number of 
The argument is rather neat and goes as follows. STONE remo VAARA 


The c.d.f. of T is defined to be 
FGP PTs == Pir >t: 


The waiting time T from one event to the next is greater than t if and only 
if no event occurs in the interval of length t following the first of the two 
events. But the number of events that occur in an interval of length t is a 
random variable X, where X ~ Poisson(At). So 


PIT >t)= P(X = 0): 
But P(X = 0) =e, so 
P(T >t) =~, 
and hence 
F(t) =1- P(T >t)=1-e¥. 
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® 


See Examples 4, 5 and 7. 


This is consistent with serious 
earthquakes happening around 
every 11 months on average. 


294 


This is the c.d.f. of an exponential distribution with parameter À, so 
T~ M()). 


The two results just discussed are stated formally in the following box. 


The Poisson process: two main results 


Suppose that events occur at random at rate A per unit time in such 
a way that their occurrence may be modelled as a Poisson process. 


e The random variable X, which represents the number of events 
that occur during a time interval of length t, has a Poisson 
distribution with parameter At: 


X ~ Poisson(At). (6) 


e The waiting time T between successive events has an exponential 
distribution with parameter A: 


T~ MON, (7) 


The parallels between the Bernoulli process and the Poisson process are 
therefore complete. These parallels are emphasised in the following 
screencast. 


Screencast 5.1 The Bernoulli and Poisson processes 


If a Poisson process is thought to be a reasonable model for the occurrence 
of events, then the two results in the box above may be used to calculate 
probabilities associated with the events. Their use is illustrated in 
Example 12. 





Example 12 Modelling occurrences of serious earthquakes 


In Section 2 an exponential distribution was used to model the waiting 
times between serious earthquakes. For the data in Table 6, the mean time 
between earthquakes was 22120/67 days, or approximately 330 days. 


If we assume that the occurrences of serious earthquakes may be modelled 
as a Poisson process, then an estimate for the rate À of the process per day 
is 

67 


Distributional Result (6) may be used to answer questions concerning the 
number of serious earthquakes in any period of fixed length. For example, 
in a typical year of 365 days, X, the number of serious earthquakes that 
occur, has a Poisson distribution with parameter 


67 
At = os per day) x (365 days) ~ 1.10556. 


Since the mean of a Poisson distribution is equal to its parameter, this is 
also, according to the Poisson process model, the mean number of serious 
earthquakes per year. The probability that exactly three serious 
earthquakes occur in a typical year is 
ee. 10556)" 

3! 
and the probability that two or more serious earthquakes occur in a typical 
year is 

P(X > 2) =1- (P(X =0)4+ P(X =1)) 
= 1 — e7 110556 (1 + 1.10556) ~ 0.3030: 


P(X =3)= ~ 0.0746; 


Distributional Result (7) may be used to answer questions about the 
waiting times between serious earthquakes. The waiting time T in days 
between serious earthquakes has an exponential distribution with 
parameter À = 67/22120, so the c.d.f. of T is 


F(t) =] ert =p e7 (67/22120) xt. t>0. 


So, for instance, the probability of a gap of more than one year between 
serious earthquakes is 


P(T > 365) = 1 — F(365) =1— (1 2 ee Suen) 


= e 110556 ~ 0,3310. 





Japan Earthquake 11/03/2011 M8.9 
Recorded at Keele University, UK 





A seismometer records the moment the Japanese earthquake (and 
then tsunami) of 2011 occurred 


Activity 16 Earthquakes in a decade 


Suppose that the occurrences of serious earthquakes may be reasonably 
modelled as a Poisson process with rate \ = 67/22120 per day. 


(a) Write down the distribution of the number of serious earthquakes 
worldwide in a typical decade. (Allow for two leap years.) 
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(b) Find the probability that there will be exactly five serious earthquakes 
in a typical decade. 


(c) Find the probability that there will be at least five and at most seven 
serious earthquakes in a typical decade. 


(d) What is the probability of a gap of more than five years between 
serious earthquakes? 


Activity 17 Grounded aircraft 


Suppose that the occurrences of wind conditions severe enough to ground 
aircraft at an airport may be modelled as a Poisson process with rate 
A = 3 per year. 


(a) Find the probability that aircraft are grounded because of severe wind 
conditions twice or more in a typical year. 


(b) Find the probability that aircraft are grounded exactly once in a 
two-year period. 


(c) Find the probability of a gap of more than two years between occasions 
when aircraft are grounded because of severe wind conditions. 


Some more examples of calculations of probabilities of numbers of events 
and of waiting times between events are contained in Screencast 5.2. 


Screencast 5.2 Calculations associated with the Poisson process 


3.2 Is a Poisson process a good model? 


Many events appear to occur ‘at random’ in continuous time in such a way 
that it is impossible to forecast when the next event will happen. For 
example, lightning strikes, road accidents, pleasant surprises, air crashes 
and earthquakes are unpredictable. But does this mean that a Poisson 
process is a good model for such events? 


Given appropriate data, Distributional Results (6) and (7) suggest a 
possible approach to the problem of deciding whether a Poisson process is 
a reasonable model for the occurrences of unpredictable events. For a 
Poisson process, the number of events that occur in an interval of length t 
has a Poisson distribution. So if data are available on the number of events 
that occur in intervals of some fixed length, then these data can be 
compared with a Poisson distribution. Or, since the waiting times between 
successive events in a Poisson process have an exponential distribution, 
given data on waiting times, we could investigate whether an exponential 
model is a good fit for the data. Another possibility, given suitable data, is 
to check whether it is reasonable to assume that the rate at which events 
occur remains constant: this is a defining property of a Poisson process. 


3 The Poisson process 


These possible approaches are illustrated in the examples that follow and 
in Chapter 10 of Computer Book A. 





Example 13 Particle counting 


The physical nature of radioactive particle emission is such that emissions 
occur entirely at random, independently of one another, but at a rate 
which remains virtually constant with passing time. This suggests that a 
Poisson process ought to be a good model for emissions. 


In 1910, the scientists Ernest Rutherford and Hans Geiger reported an 
experiment in which they counted the number of alpha particles emitted 
from a radioactive source during intervals of length 7} seconds. Counts 
were obtained for 2612 intervals, 10 126 particles being recorded in total. 
The numbers of particles emitted and the frequencies with which the 
different counts occurred are shown in Table 7. (Source: Rutherford, E. 
and Geiger, H. (1910) ‘The probability variations in the distribution of : 
alpha particles’, Philosophical Magazine, Sixth Series, vol. 20, enol e 


pp. 698-704.) (left) in the physics laboratory 
If the Poisson process is a good model for the emission of alpha particles, of the Victoria University of 
then the data in Table 7 should be very like observations from a Poisson ee 
distribution. Recall that the mean and variance of a Poisson distribution 

with parameter u are both equal to u. So if the Poisson process is a good 

model, then the sample mean and sample variance should be similar. For 

these data, the sample mean is 3.877 and the sample variance is 3.696, so 

the values of the two statistics are indeed close. Moreover, if a Poisson 

distribution is fitted to these data, then an appropriate estimate of the 

parameter u is the sample mean, 3.877. 





In Figure 7 the relative frequencies of the counts and the probability mass Table 7 Emissions 
function of a Poisson distribution with parameter 3.877 are drawn side by of alpha particles 
side. As you can see, the Poisson distribution appears to fit the data very 


Count Frequency 
well. m 


57 
0.254 E Relative frequency 
E p(x) 


OONDOHRWNEH © 
a 
© 
oa) 











0 1 2 3 4 5 6 7 8 9 10 11 12 >12 0 
Count, x 


Figure 7 Relative frequencies of emissions and the p.m.f. of the 
Poisson(3.877) distribution 
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Table 8 Emissions and 


expected frequencies 


Count Frequency Expected 
frequency 


OOMONDOTBRWNHH © 





57 
203 
383 
525 


54 
210 


More earthquake damage 


See Equations (3) and (5). The 
mean and standard deviation 
are both equal to 1/A. 
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Table 8 contains the data from Table 7 and, in the third column, the 
frequencies that we would expect to obtain for a Poisson(3.877) 
distribution. These expected frequencies were calculated in the following 
way. For a Poisson(3.877) distribution, p(0) = e7877 ~ 0.0207, so of the 
2612 75-second intervals, we would expect 0.0207 x 2612 ~ 54 intervals to 
contain no emissions. Similarly, p(1) = 3.877e7877 ~ 0.0803, so we would 
expect 0.0803 x 2612 ~ 210 intervals to contain one emission. And so on. 
(The sum of the expected frequencies is 2611 rather than 2612; this 
discrepancy is due to rounding error.) You do not need to be able to 
calculate expected frequencies like these at this stage in the module. 


Because the observed frequencies and the expected frequencies are close, a 
Poisson(3.877) distribution seems to fit the data very well. In this respect, 
the data are consistent with a Poisson process being a good model for the 
emission of alpha particles. 





A word of caution is appropriate here. Note that if a Poisson distribution 
is a good model for the counts, this does not necessarily mean that a 
Poisson process is a good model for the occurrences of the events. 
Suppose, for instance, that all the low counts occurred early on during the 
period of observation, and all the high counts towards the end. This would 
suggest that the rate of occurrence of events was not constant but was 
increasing; and in that case, a Poisson process would not be a good model 
for the occurrences of the events. Given the data only in the form of 
Table 7, we cannot check this out further. 





Example 14 Serious earthquakes 


If a Poisson process is a good model for the occurrences of serious 
earthquakes around the world, then the waiting times between successive 
serious earthquakes will be observations from an exponential distribution 
with parameter A, where À is the rate of occurrence of serious earthquakes. 


In Section 2, data on waiting times between serious earthquakes were 
introduced. These data are repeated in Table 9 for convenience. 


Table 9 Time intervals between serious earthquakes (days) 


186 77 682 449 91 1027 164 808 83 832 
328 1120 743 328 163 83 64 356 324 257 
503 232 252 151 142 32 120 100 561 755 

44 200 47 503 320 690 386 147 533 109 
560 26 460 420 291 474 131 643 71 270 
115 240 204 34 493 423 422 219 366 92 
194 230 717 506 104 91 332 


An exponential model was proposed in Subsection 2.2. But is the model a 
good fit to the data? 


For an exponential distribution, the mean and standard deviation are 
equal. The sample mean and the sample standard deviation for these data 


3 The Poisson process 


are approximately 330 days and 255 days, respectively. These values are 
not too different, but there may be a little doubt that the data come from 
an exponential distribution. 


A unit-area histogram of the data is shown in Figure 8. As already noted 

in Subsection 2.2, these data are highly skew: they peak on the left of the 

histogram and the frequencies tail off to the right. This shape is 

reasonably consistent with the data coming from an exponential 

distribution. The p.d.f. of the exponential distribution with parameter Figure 8 is essentially Figures 1 
A = 67/22120 is also shown in Figure 8, superimposed on the histogram. and 4 combined. 
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Figure 8 A unit-area histogram of the earthquake data with the p.d.f. of 
an exponential model superimposed 


With slight misgivings, it looks as though the exponential model might be 
a reasonable one for these data. 


The data in Table 9 may also be used to obtain a visual check on whether 
or not the rate of occurrence of earthquakes remains constant over time. 
Reading across the rows, the time intervals are given in order of 
occurrence. So the first earthquake after the one on 15 August 1950 
occurred after 186 days; the second, after 263 days (186 + 77); the third, 
after 945 days (186 + 77 + 682); and so on. That is, 1 earthquake had 
occurred after 186 days, 2 had occurred after 263 days, 3 after 945 days, 
etc. In Figure 9(a) (overleaf), the time of each earthquake (that is, the 
cumulative time) is plotted along a line in the manner of Figure 6. The 
same information is relayed in possibly more useful form in Figure 9(b). 
There the number of earthquakes that had occurred since the start of the 
period of observation is plotted against the times at which the earthquakes 
occurred. If the earthquakes are occurring at a constant rate, the points in 
Figure 9(b) should approximately follow a straight line through the origin. 


The points in Figure 9(b) seem to lie quite close to a straight line through 
the origin, suggesting that over the period of observation the rate of 
occurrence of serious earthquakes remained roughly constant. So it would 
seem that a Poisson process might be a reasonable model for the 
occurrences of serious earthquakes. 
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Cumulative number of earthquakes 
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Figure 9 Times of occurrence of serious earthquakes plotted in two ways: 
(a) as points along a timeline, (b) as a scatterplot of cumulative number of 
earthquakes against their times of occurrence 
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The rest of the work in this subsection is contained in Computer Book A. 
For each of a number of datasets, you will investigate informally whether a 
Poisson process might be a good model for the occurrences of the events. 
In each case, data are given either on counts or on waiting times. The 
following box is intended as a reminder of the different equalities between 
means and dispersion parameters that we are looking for if Poisson and 
exponential distributions are good models for our data. 


Equalities between means and dispersion parameters 
For a Poisson(A) distribution, 

mean = variance = A; 
for an exponential distribution with parameter A, 


mean = standard deviation = 1/X. 
You will also use Minitab to calculate probabilities associated with events 
occurring in a Poisson process. 


Refer to Chapter 10 of Computer Book A for the rest of the 
work in this subsection. 


4 Population quantiles 


Exercises on Section 3 


Exercise 6 Explosive volcanic eruptions 


Data on the intervals between successive major explosive volcanic 
eruptions in the northern hemisphere were introduced in Exercise 3. 
Suppose that occurrences of such eruptions may be modelled as a Poisson 
process with rate A = 0.0352 per month. 


(a) Write down the distribution of the number of eruptions that occur in a 
year. 


(b) (i) Find the probability that exactly one eruption occurs in a year. 


(ii) Find the probability that there is more than one eruption in a 
year. 


(c) Write down the distribution of the waiting times in months between 
successive eruptions. 


(d) Find the probability that the gap between successive eruptions will 
exceed five years. 





Exercise 7 Telephone orders at a restaurant 


Suppose that the arrivals of telephone orders at a popular pizza outlet on a 
Saturday evening may be modelled as a Poisson process with rate 15 per 
hour. 


(a) Write down the distribution of the number of telephone orders 
received in ten minutes. 


(b) (i) Find the probability that exactly three orders are received in ten 
minutes. 





(ii) Find the probability that at most two orders are received in ten 
minutes. 


(c) Write down the distribution of the waiting times in hours between the 
arrivals of successive telephone orders. 


(d) Find the probability that the gap between two successive orders will 
exceed five minutes. 





4 Population quantiles 


To conclude this unit, we consider something a bit different. In Unit 4, the 
population analogues of the sample mean, sample variance and sample 
standard deviation were defined. In this section, the population analogues 
of the sample median and sample quartiles are defined and discussed. 
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Quantiles are also pretty much 
synonymous with ‘percentiles’ 


Recall that for continuous 
distributions, P(X = m) is 
effectively 0, as it is for any 
other value in the range of X. 


F(x) derives from f(x) in the 
usual way via f f(y) dy. 
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More generally, the idea of population quantiles is introduced: the 
population median and population quartiles are examples of population 
quantiles. This idea is important for many of the statistical techniques 
discussed later in the module. 


Quantiles of continuous distributions are discussed in Subsection 4.1 and 
quantiles of discrete distributions in Subsection 4.2. 


4.1 Quantiles of continuous distributions 


Before the idea of population quantiles in general is introduced, the 
population analogues of the sample median and sample quartiles will be 
discussed briefly. 


The population median 


In Unit 1, the sample median was defined to be the middle value of a set of 
values (or halfway between the two middle values when there is an even 
number of values). Roughly speaking, the sample median divides the data 
into two halves, one with values lower than the median and the other with 
values greater than the median. 


For a continuous random variable X, the population analogue of the 
sample median is the value m that splits the probability distribution of X 
into two halves in such a way that 

P(X <m)=5 and P(X >m) =F. 
Equivalently, if F(a) is the c.d.f. of X, then 

Fim) =1=—F(m) =i. 


Thus we have the following definition. 


The population median 


For a continuous random variable X with c.d.f. F(x), the population 
median, or simply the median, is the value, m, which satisfies 


Notice that, for continuous population distributions, there is no ambiguity 
in the definition of the (population) median: there is no analogue of the 
need to treat samples of odd and even sizes separately. 





Example 15 Finding the median 

Suppose that a random variable X has p.d.f. 
f(z) =3a7, O<2<1, 

and c.d.f. 
F(x) = P(X <a2)=23, 0<2<1. 
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To find the median of this distribution, we need to find the value m such 


that P(X < 0.794) =} 


That is, we need to solve the equation 
m= 2, 
which gives 


= (4)? ~ 0.794. 








0 
0 0.25 0.5 0.75 1 
"(= 94) 


Figure 10 The median of X 


The median is shown on a plot of the p.d.f. of this distribution in Figure 10. 





Activity 18 The median of a continuous uniform distribution 


The continuous random variable X is uniformly distributed over the 
interval a < x <b: X ~ U(a,b). This distribution has c.d-f. 


T—a 
F(z) = b—a’ 


(a) What is the median of X? 





a<ax<b. 


(b) How does the median compare with the mean u = (a + b)/2? 


The result in part (b) of Activity 18 is a consequence of the symmetry of 
the uniform distribution. The ‘centre of symmetry’ is then both the 
median and the mean of the distribution. For distributions that are not 
symmetric, means and medians differ. The exponential distribution is a 
good example, as you will investigate next. 


Activity 19 The median of an exponential distribution 
A random variable X is exponentially distributed with parameter A: 
X ~ M(A). This distribution has c.d.f. 
F(x) =1-e-**, «>0. 
(a) Find the median of X. 
(b) Express the median of X as a multiple of the mean of X. 


Activity 20 Waiting times between earthquakes 


The sample median waiting time between successive earthquakes for the 
data given in Table 6 is approximately 257 days. In Example 7 an 
exponential distribution with mean 330 days was suggested as a model for 
the waiting time between serious earthquakes. According to this model, 
what is the (population) median waiting time between serious 
earthquakes? How does this compare with the sample median? 
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Population quartiles 


The idea behind the definition of sample quartiles is to choose values 
which split the data into proportions of one-quarter and three-quarters. 
There are, of course, two ways to do this. Consequently, as you saw in 
Unit 1, there are two quartiles: the values of approximately a quarter of 
the data are smaller than the lower quartile, and three-quarters are larger; 
and the values of approximately three-quarters of the data are smaller 
than the upper quartile, and a quarter are larger. The difference between 
the upper and lower quartiles gives the sample interquartile range. 


To define population quartiles for continuous distributions, ‘proportions’ 
are replaced by ‘probabilities’ in the following way. The lower population 
quartile of a continuous random variable X, denoted by qz, satisfies 


P(X < qr) = F(q) = 3, 
and the upper population quartile, denoted by quy, satisfies 
P(X < qu) = F (qu) = 2. 


So population quartiles of continuous distributions are defined as follows. 


Population quartiles 


For a continuous random variable X with c.d.f. F(x), the lower 
quartile is the value, qz, which satisfies 


F(q) =7- 
The upper quartile is the value, qu, which satisfies 
F(qu) = $. 


The interquartile range is qy — qr. 


Notice that the population interquartile range has also been defined in the 
above box, in terms of the population quartiles in the obvious way. Just as 
the sample interquartile range is a measure of the spread or dispersion in a 
sample of data, so the population interquartile range is a measure of the 
spread or dispersion in a population model. 





Example 16 Finding the lower quartile 

The c.d.f. of the random variable X of Example 15 is 
F(x)=z3, 0<z<1. 

The lower quartile, qz, is the solution of F(x) = +: That is, 
F(at) = (az)? = 3, 


and hence 





Activity 21 Finding the upper quartile and the interquartile range 


(a) Find the upper quartile gy for the random variable X of Example 16. 
(b) Hence find the interquartile range of X. 


Activity 22 Quartiles of an exponential distribution 


A random variable X is exponentially distributed with parameter A: 
X ~ M(d). 


(a) Find the lower quartile and the upper quartile of X. 
(b) Hence find the interquartile range of X. 


(c) The (sample) lower quartile of the waiting times between successive 
serious earthquakes for the data given in Table 6 is 115 days and the 
(sample) upper quartile is 493 days, so the (sample) interquartile 
range is 493 — 115 = 378 days. In Example 7, an exponential 
distribution with mean 330 days was suggested as a model for the 
waiting time between serious earthquakes. According to this model, 
what are the lower quartile, the upper quartile and the interquartile 
range of the waiting time between serious earthquakes? How do these 
compare with the values calculated from the data? 


Quantiles 


Finding a population median involves solving the equation F(x) = 5: The Qua iles 
3 


quartiles, qz and qu, are found by solving F(x) = ; and F(x) = a, 


respectively. So each of the median, the lower quartile and the upper 
quartile is the solution of an equation of the form F(x) = a, where 
0<a<1. More generally, the solution of an equation of this form is 
called a population quantile. The definition is as follows. 


Quantiles of continuous distributions 


For a continuous random variable X with c.d.f. F(x), the a-quantile 
is the value x which is the solution of the equation 


Ea aor << a <a, 
This value is denoted qa. Put another way, qa is the value such that 


F (qa) = P(X S qa) = a. 


Notice that a is a value between 0 and 1 that labels which quantile you 
require, while the a-quantile itself, qa, is a value in the range of the 
distribution. 


4 Population quantiles 
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The idea of a population quantile is illustrated in Figure 11. Each diagram 
shows the graph of a typical continuous c.d.f. F(x). For a given value a, 
say, on the horizontal axis, you could evaluate (or read off the graph) the 
corresponding value F(a) on the vertical axis. (F(a) = P(X <a) isa 
probability, so 0 < F(a) < 1.) This is shown in Figure 11(a). 

















Figure 11 Illustrating population quantiles 


In Figure 11(b), the starting point is a number a (0 < a < 1) on the 
vertical axis. The corresponding value on the horizontal axis is the 
a-quantile of X, the value qa. That is, the a-quantile qa is the value qa 
which satisfies F (qa) = a. (Mathematically, what we are doing is inverting 
the cumulative distribution function. We could therefore write 

da = F~'(a), where F~! is the inverse function of F. But we won’t 
usually do so explicitly in this module.) 











da 
£ 


Figure 12 Illustrating a population quantile on the p.d.f. 


By definition, F(x) = P(X < zx), so 
F(qa) = P(X < qa) = a. 


So the definition of the a-quantile, qa, says that the values of a proportion 
a of the population are less than or equal to the a-quantile. This can also 
be illustrated on the p.d.f. (as opposed to the c.d.f.) as shown in Figure 12. 
Note that Figures 11(a), 11(b) and 12 are different ways of showing the 
same thing. In particular, as you already know, half the values are below 
or equal to the median m, so m = qo.5; and a quarter of the values are 
below or equal to the lower quartile gz, so gz = qo.25. Similarly, 

F (qu) = 0.75, so qu = qo.75. The median and quartiles are illustrated on a 
c.d.f. in Figure 13. 




















Figure 13 The median and quartiles 


The number a is often expressed as a percentage. When this is the case, 
the quantile is often called a percentile or percentage point; these terms 
are synonymous with quantile. So, for example, the 0.3-quantile is also the 
30th percentile, and the 95% percentage point is also the 0.95-quantile. 
Also, if œ is an integer multiple of 0.1, the corresponding quantiles are 
sometimes called deciles: that is, the deciles are go.1, 90.2,--- 490.9: 


The median and quartiles have been generalised to population a-quantiles 
because many statistical techniques depend on the idea of quantiles. In 
particular, for some of the methods introduced later in the module, the 
‘extremes’ or ‘tails’ of a distribution are important. So quantiles associated 
with values such as 0.9, 0.95, 0.99, at one extreme, and 0.1, 0.05, 0.01, at 
the other, are often used. 


You can get more of a feel for the definition and meaning of quantiles of 
continuous distributions using the computer animation provided in 
Chapter 11 of Computer Book A. 


Refer to Chapter 11 of Computer Book A to explore quantiles 
further. 


Example 17 Finding quantiles mathematically 

The c.d.f. of the random variable X of Example 15 is 
F(2)=2°, 0<2<1. 

So the 0.1-quantile, go.1, is the solution of F(x) = 0.1. That is, 
F(qo1) = (90.1)? = 0.1, 

and hence 


qo.1 = (0.1)! ~ 0.464. 


4 Population quantiles 


goongna 


DOC fj ide 





‘Almost half of Britain’s private 
wealth owned by top 10% of 
households’, headline in The 
Guardian, 18 December 2015. 
The ‘top 10%’ of people are 
those whose wealth exceeds the 
qo.9 decile, or equivalently the 
0.9-quantile or 90th percentile. 


QJ 
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There are also cases where the 
c.d.f. formula is available but it 
cannot be explicitly inverted. 
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Similarly, go.9 is the solution of F(x) = 0.9. So 
(q0.9)° = 0.9, 


and hence 


qo.9 = (0.9)! ~ 0.965. 





Activity 23 Finding more quantiles 


The c.d.f. of the random variable Z is 
F@=2, 0<z<1. 


Find the quantiles go.95 and qo.99 for Z. 


Activity 24 Quantiles for an exponential distribution 


A random variable X has an exponential distribution with parameter A: 
X~ M(X). 


(a) Find the quantiles go.o1 and qo.s of X. 


(b) Following Figure 3, make a very rough sketch of the c.d.f. of the 
exponential distribution with parameter A, and on it mark a = 0.8 
and d0.8- 


(c) Following Figure 18 in the solution to Activity 19, make a very rough 
sketch of the p.d.f. of the exponential distribution with parameter A, 
and on it mark the mean and qo.s. 


In this subsection, you have seen how quantiles for a continuous random 
variable are calculated using the c.d.f. F(x). However, for some continuous 
distributions, a simple formula for the c.d.f. is not available. In such cases, 
printed tables or computer software must be used to find quantiles. 


4.2 Quantiles of discrete distributions 


Quantiles of continuous distributions are found by solving the equation 
F(a) =a for x. In all the examples you have met, and in all the cases you 
will encounter in this module, this equation has a unique solution. 


Unfortunately, when quantiles of discrete distributions are discussed, this 
simplicity and lack of ambiguity disappears. Quantiles of discrete 
distributions are not as important as those of continuous distributions, so 
they will be discussed only very briefly here. In this subsection, we will 
look at some of the problems and how they might be overcome. 


Example 18 The median score on a fair die 


We begin with the example of a uniform distribution on the integers 
1,2,...,6. This is the model adopted for X, the score obtained when a fair 
six-sided die is rolled. The c.d.f. F(x) of X is shown in Table 10. 


Table 10 The c.d.f. of the score on a fair die 


If we solve the equation F(x) = $ to find the median, then this yields 
m = 3. 


However, when the population analogue of a sample median was 
introduced at the beginning of Subsection 4.1, the underlying idea was to 
find a value m which splits the distribution of X into two halves in such a 
way that P(X < m) = 4 and P(X > m) = 35. For the random variable X 
in this example, P(X < 3) = 5 and P(X > 4) = $. So, which of the 
values 3 and 4 should we take as the median of X? If the median is 
regarded as an indication of the ‘centre’ of the distribution, then neither 
value is satisfactory. So perhaps we should use neither value: any 
(non-integer) value x strictly between 3 and 4 does satisfy P(X < x) = 4 
and P(X > x) = 5: Should we take one of these values? Considerations of 
symmetry suggest 3.5, which is at the centre of the distribution (and is 
also the mean of the distribution). But, as you will soon see, this is not the 
choice usually made. 





Example 19 The median of a uniform distribution 


Another problem is illustrated by considering a random variable Y which 
is uniformly distributed on the integers 1,2,...,5. The c.d.f. is given in 
Table 11. 


Table 11 The c.d.f. of a uniform distribution 
5 
1 


In this case, if we try to solve the equation F(y) = 5, then we find that the 
equation is not satisfied for any value in the range of Y. (In fact, it is not 
satisfied for any value y, such as 2.5, between values in the range of Y 
either.) Considerations of symmetry suggest m = 3, which is at the centre 
of the distribution (and is also the mean of the distribution). But is this 
satisfactory since F'(3) = 2? 





These examples indicate that the definition of the median needs to be 
modified for discrete distributions. One possibility is to define the 
population median to be the minimum value m such that F(m) > 4. For 
the uniform distribution on the integers 1,2,...,5 of Example 19, this 
yields the solution m = 3, which is in the middle of the range. However, 


4 Population quantiles 


If a sample turned out to take 
the values 1, 2, 3, 4, 5, 6, then 
3.5 would be the sample median. 
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‘Perverse’ and ‘arbitrary’ might 
also come to mind, but this is a 
fairly standard definition of the 
population median of a discrete 
distribution 
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this definition also yields the value m = 3 for the distribution of 

Example 18, and 3 is not the ‘centre’ of that distribution. This is not 
entirely satisfactory. Nevertheless, this is the definition that is used in this 
module. 


Problems similar to those discussed above arise when finding other 
quantiles for discrete distributions. The definition of quantiles is modified 
in a similar way. The definition used in this module, which includes the 
definition of the median, is therefore as follows. 


Quantiles of discrete distributions 


For a discrete random variable X with c.d.f. F(x), the a-quantile qa 
is defined to be the smallest value of x in the range of X satisfying 


EMG) Z C 





Example 20 Quartiles of a binomial distribution 


Table 12 contains the cumulative distribution function F(x) of a random 
variable X having a binomial distribution with parameters n = 6 
and p = 0.6: X ~ B(6,0.6). 


Table 12 The c.d-f. of a binomial distribution, B(6, 0.6) 


x 0 1 2 3 4 5 6 
F(x) 0.0041 0.0410 0.1792 0.4557 0.7667 0.9533 1 


To find the median of X, we need to find the smallest value of x in the 
range of X such that F(x) > 0.5. From the table, F(3) = 0.4557, which is 
less than 0.5, and F'(4) = 0.7667, which is greater than 0.5, so 


F(3) < 0.5 < F(4). 
Hence the median nm is 4. 


The lower quartile gz is the smallest value of x in the range of X for which 
F(x) > 0.25. Since F(2) < 0.25 < F(3), the lower quartile qz is 3. 
Similarly, since F(3) < 0.75 < F(4), the upper quartile qy is 4. 


Notice that in this example, the median and the upper quartile are equal. 





Activity 25 Quantiles of a binomial distribution 


Use the values of the c.d.f. given in Table 12 to find the quantiles qo.01, 
go.05; Go. and qo.95 for the binomial random variable X of Example 20: 
X ~ B(6,0.6). 


The final work in this subsection consists of another short chapter of 
Computer Book A. This chapter explains how to use Minitab to find 
quantiles for both continuous and discrete distributions. 


Refer to Chapter 12 of Computer Book A for the rest of the 
work in this subsection. 


Exercises on Section 4 





Exercise 8 Quartiles of a continuous uniform distribution 
The random variable X is uniformly distributed over the interval 
a<x<b: X ~U(a,b). It has c.d.f. 


F(z) = ZE, ae oe a, 


Find the lower quartile, upper quartile and interquartile range of X. 





Exercise 9 Finding quantiles of another continuous distribution 
The c.d.f. of the continuous random variable Y is given by 

F(y) = ty’, O<y<2. 
(a) Find the median, the quartiles and the interquartile range of Y. 


(b) The c.d.f. of interest in this question is shown in Figure 14. 


14 








0 T 
0 1 2 


y 
Figure 14 The c.d.f. F(y) = 4y? 


On a copy of this figure, sketch the positions of the median and 
quartiles, and the corresponding values of a. 


Exercise 10 Finding quantiles of yet another continuous distribution 
The c.d.f. of the continuous random variable Z is given by 
F(z)=2łf, 0<z<1, 
with 6 > 0. 
(a) Find a general formula for the a-quantile of Z. 
(b) Find the median, the lower quartile and the 0.8-quantile of Z. 


(c) Hence give the 0.8-quantile for the particular choice 8 = Z. 


4 Population quantiles 


QJ 
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Exercise 11 Quartiles of a discrete uniform distribution 
The random variable X is uniformly distributed on 1,2,...,10. 


(a) Draw up a table showing values of the c.d.f. F(x) of X for 
£=1,2,...,10. 


(b) Find the lower quartile, the median and the upper quartile of X. 


Summary 


In this unit, you first saw, numerically, how the Poisson distribution with 
parameter u = np provides a useful approximation to the binomial 
distribution B(n, p) for large values of n and small values of p. You also 
met the exponential distribution, denoted M (A), a continuous distribution 
with (rate) parameter \ > 0. The exponential distribution has mean 1/A 
and variance 1/)?. 


Two models for events that occur at random as time progresses have been 
discussed. The Bernoulli process is a model for events that may occur only 
at a discrete set of opportunities, or trials, in such a way that the 
probability that an event occurs at each trial is constant and is 
independent of the outcomes of previous trials. The Poisson process is the 
continuous-time analogue of the Bernoulli process: events occur singly at a 
constant rate, and the incidence of future events is independent of the past. 


The parallels between the Bernoulli and Poisson processes are summarised 
in Table 13. 


Table 13 The parallels between the Bernoulli and Poisson processes 


Process Type Distribution of Distribution of waiting 
number of events time between events 


Bernoulli discrete Binomial Geometric 
Poisson continuous Poisson Exponential 


The population analogues of the sample median and sample quartiles have 
also been discussed, and the idea generalised to population quantiles. 


Learning outcomes 


After you have worked through this unit, you should be able to: 


understand and use the fact that the Poisson distribution with 
parameter = np provides a useful approximation to the binomial 
distribution B(n, p) for large values of n and small values of p 


use the fact that the parameter A of an exponential distribution may 
be estimated from data by the reciprocal of the sample mean 


recognise that the number of events in a Poisson process with rate A 
per unit time that occur in an interval of length t has a Poisson 
distribution with parameter At 


recognise that the waiting time between successive events in a Poisson 
process with rate A has an exponential distribution with parameter A 


use the Poisson distribution and the exponential distribution to 
calculate probabilities associated with events occurring in a Poisson 
process, both mathematically and by using Minitab 


explore data to investigate whether a Poisson process is a plausible 
model for the occurrences of events 


appreciate that, in the continuous case, the a-quantile, qa, solves 
F (qa) =a,0<a< 1, where F is the cumulative distribution function 


appreciate that, in the discrete case, the a-quantile, qa, is the smallest 
value of x in the range of X satisfying F(x) >a,0<a<1 


calculate population a-quantiles, both mathematically and by using 
Minitab 
remember that the median, m, is the 0.5-quartile, the lower quartile, 


qL, is the 0.25-quartile, and the upper quartile, gy, is the 0.75-quartile, 
and that qu — qz is the interquartile range. 


Learning outcomes 
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Solutions to activities 


Solution to Activity 1 


(a) Assuming 50 journeys a month and using the given average of 15 
drenchings a year, the proportion of journeys that result in a 
drenching is 

15 i651 


12x50 600 40 


(b) The number of times a month that Chris gets drenched, X, has a 
binomial distribution: 


X ~ B(50, 1/40). 


p= 


(c) Assuming the binomial model, the mean number of drenchings a 
month is 


i 
E(X)= = 50 x — = 1.25. 
(X)=nxp X 75 


(d) (i) The probability that Chris does not get drenched at all in a 
month is 


50 0 50 50 
P(X =0) = @ (4) (#2) = (8) ~ 0.2820. 
(ii) The probability that Chris gets drenched twice is 


P(X =2) = e Gej = 1225 (4)? C) ~ 0.2271. 


Solution to Activity 2 
If X ~ B(60, 1/48), then 


P(X =0) = (42) ~ 0.2897. 


To obtain this result, full calculator accuracy was retained for 47/48. 
Rounding 47/48 to 0.979, for instance, would give (0.979)°° ~ 0.2799, and 
so would introduce quite serious rounding errors. In calculations like 
these, it is important not to round intermediate values. 


Also, if X ~ B(60, 1/48), then 


P(X =2)= (2) (4)? (E) = 1770 (4)? (2) ~ 0.2266. 
The completed table is as follows. 
Table 14 

P(X =0) P(X=1) P(X =2) P(X =3) P(X>4) 
B(50, 1/40) 0.2820 0.3615 0.2271 0.0932 0.0362 
B(60, 1/48) 0.2827 0.3610 0.2266 0.0932 0.0365 


Notice that the two sets of probabilities do not differ greatly: all the 
differences between values are less than 0.001 in absolute value. 


Solution to Activity 3 


(a) The number of faulty resistors in a box, X, has a binomial 
distribution: X ~ B(50,1/20) or X ~ B(50,0.05). So 
P(X =1)= a (0.05) (0.95)*? ~ 0.2025, 


P(X =3)= 5) (0.05)? (0.95)*" ~ 0.2199. 


(b) Since the mean of X is 50 x 0.05 = 2.5, the approximating probability 
distribution is Poisson(2.5). So 


-2.52.5 

P(X =1)~% —— ~ 0.2052, 
Sa oy 

P(X =3)~ = ~ 0.2138. 


(c) The probabilities are ‘similar’, but are not really very close — certainly 
not as close as in some previous activities and examples. The 
parameter p is 0.05, which is at the limit of the range of values given 
in the rough rule for when a Poisson approximation will be a good 
one. Also, n is not as large as it has been in some previous examples. 


Solution to Activity 4 


X, the number of passengers who have booked a seat but do not turn up, 
may be modelled by a binomial distribution: X ~ B(200,0.01). Since p is 
small and n is large, this may be approximated by a Poisson distribution 
with mean u = 200 x 0.01 = 2. 


Everyone who turns up will get a seat if at least two people fail to turn up. 
Hence the required probability is 


P(X > 2) =1-— (P(X =0)4+ P(X =1)) ~ 1 — (e7? + 2e7) ~ 0.594. 


Solution to Activity 5 


In realisation (b), the observed values of X are 4, 3, 1, 1, 6, 2, 1, 3 and 1. 
The next value of X must be 4 or more. 


In realisation (c), the observed values of X are 2, 2, 1, 2, 5, 5, 1 and 2. 
The next value of X must be 6 or more. 
Solution to Activity 6 


The number of trials from one success to the next, X, has a geometric 
distribution with parameter p = 0.3. 


(a) P(X =x) = (0.7)77t x 0.3, z= 1,2,..., 
so we have the following. 
(i) P(X = 2) = 0.7 x 0.3 = 0.21. 
(ii) P(X = 6) = (0.7) x 0.3 ~ 0.050. 
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(b) PX Seale. =] e 
SO 
P(X > 3)=1-— P(X < 2) = 1 — (1 — (0.7)?°) = 0.49. 
Alternatively, 
P(X > 3) =1- (P(X =1)+ P(X =2)) 
= 1 — 0.3 — 0.21 = 0.49. 


Solution to Activity 7 


(a) For a fair six-sided die, p = 1/6. The p.m.f. of X, the number of rolls 
from one six to the next, is therefore given by 


plx) = ae ee, CSL: 


P(X =4) = (8)° x 4 ~ 0.096. 


(b) The c.d.f. of X is given by 
F(x) = P(X <2) =1- ($), PEL eon 
So 
534 
P(X >5)=1- P(X <4)=1-(1-() ) = 0.482. 


Solution to Activity 8 


(a) Two assumptions are made. First, whether or not a serious 
earthquake occurs on any particular day is independent of whether or 
not a serious earthquake occurs on any other day. Second, the 
probability that a serious earthquake occurs on any single day remains 
constant from day to day. 


(b) The time between successive serious earthquakes has a geometric 
distribution with parameter p, where p is the probability of a serious 
earthquake on any single day. 


(c) The histogram is highly skew, with the largest frequencies for low 
values and with frequencies tailing off towards the right. So a 
geometric model might be a reasonable one. 


(d) We could calculate the mean time between serious earthquakes for the 
given data. Then, since the mean p of a geometric distribution is 
equal to 1/p, we could estimate p by the reciprocal of the sample 
mean. (This is equivalent to dividing the number of serious 
earthquakes after 15 August 1950, which is 67, by the total number of 
days, thus calculating the proportion of days on which a serious 
earthquake occurred.) 


Solution to Activity 9 


A time interval of two years, with no leap year, corresponds to 730 days 
altogether. The probability of a gap exceeding 730 days is 


67 730 
P(X =1]1— F =1 1 1 
(X > 730) (730) ( ( z =) 


730 
= (1 = y] ~ 0.1092, 


or a little above 1 in 10. Since 0.1092 x 67 ~ 7.3, we might expect around 
seven intervals in a list of 67 to exceed 730 days. In fact, in Table 6 there 
are six such gaps; they are of lengths 1027, 808, 832, 1120, 743 and 

755 days. 





Solution to Activity 10 


(a) [eda Dreme +e 








Solution to Activity 11 


The range of the exponential distribution has lower limit 0. So, taking f 
as in Equation (1), 


F(z) = [ twa [oa 


eTòY 
[5 


This is Equation (2), as required. 





| = — (e> — e’) =1- e7”, 
0 


Solution to Activity 12 
(a) (i) 











0 T T T 
0 200 400 600 [800 1000 1200 


z 730 


Figure 15 Area corresponding to P(X > 730) on Figure 4 
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(ii) The probability that the interval between successive serious 
earthquakes will exceed two years is 
P(X S730) = 1 -F(730) =1- (1 a ee) 
œ e7?21112 ~ 0,1096. 


This is very close to 0.1092, the value found in Activity 9 using a 
geometric model. 


(b) The probability that the gap between successive earthquakes will be a 
week (7 days) or less is 


P(X < 7) = F(T) = 1 — e7 (67/22120)xT ~ ] — ẹ70-02120 ~ 0,0210. 
(c) The proportion of intervals that exceed the mean is 
P(X > 22120/67) = 1 — F(22120/67) 
—1— (1 _ gC een) 


=e! ~ 0.3679. 
So according to the model, a little over a third of gaps are longer than 
average. 


For the data in Table 6, 27 out of 67, or approximately 0.4030, of 
intervals between serious earthquakes exceed 330 days. The two 
proportions are quite close, although the observed proportion is the 
larger. 


Solution to Activity 13 
(a) 





P(365 < X < 730) 













800 1000 1200 


365 730 


Figure 16 Area corresponding to P(365 < X < 730) on Figure 4 





(b) The probability that the interval between successive serious 
earthquakes will be between one and two years is 
P(365 < X < 730) = e7 (67/22120)x365 _ ~ (67/22120)x730 
~ e™1-10556 _ .-2.21112 ~ 0,2214. 
(c) The probability that the interval between successive serious 
earthquakes will be between a week and a month is 
P(T < X < 30) = e7 (67/22120)x7 _ ¢(67/22120) x30 
~ e~ 0-09087 _ „—0.02120 ~ 0.0659. 
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Solution to Activity 14 


If the waiting time between successive events is represented by an 
exponential random variable X with parameter A, then the mean waiting 
time is 1/A. So the proportion of waiting times that are longer than the 
mean waiting time is given by 


Px siei- Asi- (1 = eu) = e7! ~ 0.3679. 


This proportion is independent of the value of the parameter A. (You saw 
this value arise in the solution to Activity 12(c) when A = 67/22120, but 
you have now shown that it is the correct proportion whatever the value 
of À.) 

Solution to Activity 15 


In each case X, the number of events in an interval, has a Poisson 
distribution with parameter At, where A is the rate at which events occur 
per unit time and t is the length of the interval. 


(a) In this case, A = 3 per minute. 
(i) The length of the interval is 5 minutes, so 
At = (3 per minute) x (5 minutes) = 15. 
Therefore X ~ Poisson(15). 
(ii) For an interval of a quarter of an hour, 
At = (3 per minute) x (15 minutes) = 45. 
So X ~ Poisson(45). 
(b) In this case, A = 5 per hour. 
(i) For an interval of length 34 hours, 
At = (5 per hour) x (35 hours) = 17.5. 
So X ~ Poisson(17.5). 
(ii) For an interval of 15 minutes, 
At = (5 per hour) x (4 hour) = 1.25. 
So X ~ Poisson(1.25). 


Solution to Activity 16 


(a) Allowing for two leap years, there are 3652 days in a decade. So X, 
the number of serious earthquakes in a typical decade, has a Poisson 
distribution with parameter 





67 
At = (= 720 Pet dav) x (3652 days) ~ 11.06167. 


(b) The probability that there will be exactly five serious earthquakes in a 
decade is 


=11:06167 (11.06167) 
P(X =5)= cee) ~ 0.0217. 
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(c) The probability that there will be at least five and at most seven 
serious earthquakes in a decade is 


P(5< X < T= P(X =5) + P(X =6) 4+ P(X =7) 
e™11-06167(11,06167)5 
5! 
r (1 4 11.06167 J eee 
6 7x6 
~ 0.1248. 


(d) The time T between serious earthquakes has an exponential 
distribution with parameter À = 67/22120, so the probability of a gap 
of more than five years, or 3652/2 = 1826 days, is 


P(T > 1826) =1- (1 = aa ~ 0.0040. 


Solution to Activity 17 


(a) If X is the number of groundings in a typical year, then X has a 
Poisson distribution with parameter 


At = (3 per year) x (1 year) = 3. 
So 
P(X > 2) =1- (P(X =0)4+ P(X =1)) 
= 1 — e™?(1 + 3) ~ 0.8009. 


(b) If Y is the number of groundings in a two-year period, then Y has a 
Poisson distribution with parameter 


At = (3 per year) x (2 years) = 6. 
So 
P(Y = 1) =6e~® ~ 0.0149. 


(c) The time T between groundings has an exponential distribution with 
parameter \ = 3, so the probability of a gap of more than two years 
between groundings is 


P(T > 2) =1-— F(2) = 1 — (1 — e™?*?) =e~® ~ 0.0025, 
or about 1 in 400. 


Solution to Activity 18 
(a) We need to find m such that 
m-a 1 
Be a 
Multiplying both sides by b — a, then adding a to both sides, yields 
m=a+t4(b—a)=$(b+a). 
(b) The median is equal to the mean. 


Both the mean and the median are midway between a and b, as shown 
in Figure 17. 











Figure 17 The p.d.f., mean and median of X ~ U(a,b) 


Solution to Activity 19 


(a) 


The median m satisfies the equation 
Fimis e "s, 


So 


or 
1 
—Am = log 5 = — log 2 


and hence 
_ log2 _ 0.693 


A À 
Here, as throughout this module, the notation log is used to refer to 


the ‘natural’ logarithm (the inverse of the exponential function). 
The mean of X is u = 1/A, so 
m = log 2 x u ~ 0.6934. 


So for an exponential random variable, the median is approximately 
70% of the mean. This result, which is illustrated in Figure 18 
(overleaf), is due to the considerable skewness of the exponential 
distribution. 
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Figure 18 The mean and median of an exponential distribution 


Solution to Activity 20 


From the solution to Activity 19, the median waiting time m, using the 
exponential model, is 


m = log2 x u = log 2 x 330 ~ 229 days. 
This is a bit less than the sample median of 257 days. 
Solution to Activity 21 
(a) The upper quartile, qu, is the solution of F(x) = 3. That is, 
F(qu) = (qu)? = 3, 
and hence 
1/3 
qu = (3)'" ~ 0.909. 
(b) The interquartile range of X is 
1/3 1/3 
jeg 2) GZ) 2007. 
Solution to Activity 22 
(a) The c.d.f. of X is 
F(z2)=1-e**, «>0. 
So the lower quartile gz satisfies 


1 
F(q) =1—eO" = -; 


4’ 
that is, 
3 
=A — 2 
€ A’ 
or 
3 
—Aqr = log 7 


sO 


1, 3 0.288 
ee 


The upper quartile qu satisfies 

3 

F(qu) =1-e°" = T 
that is, 


= 1 
equ =<, 


sO 


__1,,,1 ~ 1386 
qu =-5 87% 


(b) The interquartile range of X is 


À 


(Notice that this is slightly larger than the standard deviation of an 
exponential distribution, which is equal to 1/A.) 

If we use the results from parts (a) and (b) with 1/A = 330, then, 
according to the model, the quartiles (in days) are 


gr ~ 0.288 x 330 ~ 95, qu = 1.386 x 330 ~ 457. 


1 3 1 1.099 
= ~ | log- — log - ] x ——. 


So the interquartile range is approximately 457 — 95 = 362 days. The 
lower quartile given by the model is a bit smaller than the sample 
lower quartile, and the upper quartile for the model is also a bit 
smaller than the sample upper quartile. The population and sample 
interquartile ranges turn out to be similar to one another. 


Solution to Activity 23 
The 0.05-quantile, qo.o5, satisfies F'(qo.05) = 0.05. That is, 


F(qo.05) = (qo.05)* = 0.05, 


and hence 


qo.o5 = (0.05)1/4 ~ 0.473. 


The 0.99-quantile, go.99, satisfies F'(qo.99) = 0.99. That is, 


an. 


F(qo.99) = (qo.99)* = 0.99, 


d hence 


qo.99 = (0.99)*/4 ~ 0.997. 
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Solution to Activity 24 


(a) The 0.01-quantile, go.91, of an exponential random variable with 
parameter A satisfies 


F(qo.01) = 1—e7*?-™ = 0.01. 


So 

e7ò10.01 = 0,99, 
leading to 

—\qo.01 = log 0.99, 
and hence 


1 0.010 
= —— log 0.99 ~ ——. 
qo.01 3 og 5 


The 0.8-quantile, go.g, satisfies 
F(qo.8) = 1—e-*%* = 0.8. 
So 
e7ò%0.s — 0.2, 


and hence 


1 1.609 
= —— log 0.2 ~ ——. 
qo.8 X og X 


(b) The c.d.f. of X, following Figure 3, is shown in Figure 19; a = 0.8 is 
marked, as is the horizontal dashed line connecting it to the graph 
of F, as well as the vertical dashed line connecting F'(qo.g) to 
d0.8 = 1.609/X. 


1-4 
0.84 

















(c) The p.d.f. of X, following Figure 18, is shown in Figure 20; the 
0.8-quantile go.3 = 1.609/A is marked at the correct multiple of the 
mean 1/2. 


Figure 19 The c.d.f. of X ~ M(A) 
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Figure 20 The p.d.f. of X ~ M(A) 


Solution to Activity 25 
From Table 12, we have 
F(0) = 0.0041 < 0.01 < 0.0410 = F(1), 
so 
F(0) < 0.01 < F(1), 
and hence qo.01 = 1. 
Similarly, F'(1) < 0.05 < F(2), so go.05 = 2; 
F(4) < 0.8 < F(5), so qo.8 = 5; 
and F(4) < 0.95 < F(5), so qo.95 = 5 also. 
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Solutions to exercises 


Solution to Exercise 1 


(a) 


(b) 


(c) 


Since the mean number of errors on a page is 3.6, an appropriate 
approximate model for the number of errors on a page is a Poisson 
distribution with parameter 3.6. 


The Poisson probabilities are given by the formula 
P(X = x) = e~*°(3.6)”/z!. They are given alongside the binomial 
probabilities from Table 4 in the table below. 


Table 15 Probabilities of errors 
P(X =0) P(X=1) P(X =2) P(X =3) P(X>4) 


B(320,0.01125) 0.0268 0.0975 0.1769 0.2134 0.4854 
B(360, 0.01) 0.0268 0.0976 0.1769 0.2133 0.4854 
Poisson(3.6) 0.0273 0.0984 0.1771 0.2125 0.4847 


The probabilities using the Poisson model agree with the binomial 
probabilities to three decimal places. The approximation is really 
quite good. 


Solution to Exercise 2 


(a) 


(b) 


The number of people with tritanopia in a group of 100 people has a 
binomial distribution: X ~ B(100, 0.0001), so 


PO =i a) (0.0001) (0.9999)°° ~ 0.0099. 


The mean number of people with tritanopia in a group of 100 people 
is 100 x 0.0001 = 0.01, so, approximately, X ~ Poisson(0.01), and 
—0.01 
0.01 
P(X =1) > — ~ 0.0099. 
The Poisson approximation gives the same value as that given by the 
binomial model to four decimal places. 


Solution to Exercise 3 


(a) 


The shape of the histogram is not inconsistent with the data coming 
from an exponential distribution. In addition, the sample mean and 
sample standard deviation are fairly close in value. So an exponential 
model might well be suitable. 


The interval between two successive eruptions need not be a whole 
number of months; it could be any length. So a continuous model is 
more appropriate here. For this reason, an exponential model is more 
suitable than a geometric model. 


Solution to Exercise 4 


(a) Three assumptions are made. First, whether or not an eruption occurs 
in any particular month is independent of whether an eruption occurs 
in any other month. Second, the probability that an eruption occurs 
in any month remains constant from month to month. Finally, only 
one eruption can occur in any month. 


(b) Since the mean of a geometric distribution is equal to 1/p, an estimate 
of p is given by the reciprocal of the sample mean. So an estimate 
of p is 
p = 1/28.4 ~ 0.0352. 


(c) There are 120 months in ten years. So if X is the number of months 
between successive major explosive eruptions, then the probability 
required is 


1 
28.4 


or approximately 1 in 74. Looking at the histogram in Figure 5, you 
can see that two out of the 55 gaps exceeded ten years. This is in line 
with the 0.0136 x 120 ~ 1.6 suggested by the model. 


120 
P(X > 120) = (1 ) ~ 0.0136, 


Solution to Exercise 5 
(a) Since the mean time between major explosive eruptions is 


28.4 months, an estimate of the parameter A is 


A= : 0.0352 
— BAT i 


(b) According to the exponential model, the probability that X, the gap 
between major explosive eruptions, exceeds ten years is given by 


P(X > 120) = 1 — F(120) = e7 129/784 ~ 0.0146, 


or about 1 in 68. This probability is slightly greater than that 
obtained using the geometric model. 


(c) According to the exponential model, the proportion of gaps between 
major explosive eruptions that are between 1 and 5 years in length, 
that is, between 12 and 60 months, is (using Equation (4)) 


P(12 < X < 60) = e7 12/28.4 _ ,—60/28.4 
~ 0.5345, 


or a little more than a half. 


Solution to Exercise 6 


(a) If X represents the number of eruptions that occur in a year, then X 
has a Poisson distribution with parameter At = 0.0352 x 12 = 0.4224. 


(b) (i) The probability that exactly one eruption occurs in a year is 
given by 


P(X = 1) = 0.4224e7™4??4 ~ 0.2769. 
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(ii) The probability that there is more than one eruption in a year is 
given by 
P(X Sh H=1 (P(X =) + P(4=71)) 
= 1 — e7 9" + 0.4224) ~ 0.0677. 
(c) If T is the waiting time in months between eruptions, then T has an 
exponential distribution with parameter A = 0.0352. 


(d) The probability that the gap between successive eruptions will exceed 
five years, which is 60 months, is given by 
P(T > 60) = 1 — F(60) = 1 — (1 — e7®0352x60) 


= g Cr ee 0,1210: 


Solution to Exercise 7 


(a) If X is the number of telephone orders received in ten minutes, or 
1/6 hour, then X has a Poisson distribution with parameter 


1 
a es 
<s 


(b) (i) The probability that exactly three orders are received in 
ten minutes is given by 
a en" (2.6)? 
E 3! 
(ii) The probability that at most two telephone orders are received in 
ten minutes is 
P(X < 2) = P(X =0)+ P(X =1) + P(X =2) 


2.5)? 
=e? (1 £95 + =] ~ 0.5438. 


~ 0.2138. 


(c) If T is the waiting time in hours between telephone orders, then T has 
an exponential distribution with parameter \ = 15. 


(d) The probability that the gap between successive telephone orders will 
exceed five minutes, or 1/12 hour, is given by 


=15*1/12 
P(T > $) =1-F(})=1- (1- e712) 
=e 15x1/12 _ 671-25 ~ (0.2865. 


Solution to Exercise 8 
First, we need to find qzr such that 


q,-a_ 1 
F(q1) = bea T 


Multiplying both sides by b — a, then adding a to both sides, yields 





1 1 
qr =a + z-a) = gO + 3a). 


Similarly, 
qu-a_3 
F = =~ 
tS a 
leads to 


3 1 
qu =a + 5(b—a) = 5(3b + a). 


The interquartile range is therefore 


1 i 
qu — qL = (38 + a — b- 3a) = 7 (2b — 2a) 
= 5(b- a). 


Solution to Exercise 9 
(a) The median m is the solution of F(y) = 4. So 
F(m) = tm? =, 
and hence m? = 2, so m = 2 ~ 1.414. 
The lower quartile, qz, is the solution of F (y) = F, so 
F(q) = F(q) = F, 
and hence qz = 1. 
The upper quartile, qu, is the solution of F (y) = ł, so 
F(qu) = t(q)? = 3, 
and hence (qu)? = 3 and qu = V3 ~ 1.732. 
Therefore the interquartile range is 
qu — qL ~ 1.732 — 1 = 0.732. 


(b) The values of a = $, - and 3 and the corresponding values of m, gz 


and quy are shown in Figure 21. 




















Figure 21 Thec.d.f. F(y) = iy? showing m, qz and qu 


Solutions to exercises 


329 


Unit 5 Events occurring at random and population quantiles 


330 


Solution to Exercise 10 


(a) 


(b) 


(c) 


The a-quantile qa is the solution of F(z) = a. So 
F(qa) = (da) = a, 
and hence qa = a’/?, 


The median, lower quartile and 0.8-quantile are the a-quantiles when 
a = 0.5, 0.25 and 0.8, respectively. They are therefore given by 
m = (0.5)/8, qr = (0.25)/8 and qo.g = (0.8)1/8. 


When 8 = 3, 40.8 = (0.8)? = 0.64. 


Solution to Exercise 11 


(a) 


The c.d.f. of X is given in the table below. 
Table 16 
x 1 2 3 4 5 6 7 8 9 10 


F(z) 0.1 02 03 04 0.5 0.6 0.7 0.8 0.9 1 


The lower quartile is the smallest value x in the range of X such that 
F(x) > 0.25. From Table 16, F(2) = 0.2, which is less than 0.25, and 
F(3) = 0.3, which is greater than 0.25, so F(2) < 0.25 < F(3), and 
hence the lower quartile qz is 3. 


Similarly, F'(4) < 0.5 < F(5), so the median is 5. And 
F(T) < 0.75 < F(8), so the upper quartile is 8. 
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